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MEMBRANE SCAFFOLD PROTEINS 

CROSS REFERENCE TO RELATED APPLICATIONS 

This application is a Continuation-in-Part of United States Provisional Application No. 
60/252,233, filed November 20, 2000. 
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This invention was made, at least in part, with funding from the National Institutes of 
Health (Grant Nos. (R01GM31756, R01GM33775, GM63574 and 5F32GM19024). 
Accordingly, the United States Government has certain rights in this invention. 

BACKGROUND OF THE INVENTION 
The field of present invention encompasses molecular biology and membrane technology . 
Specifically, the present invention relates to artificial membrane scaffold proteins (MSPs), 
sequences encoding them, vectors and recombinant host cells, methods for recombinant 
production of them, and methods of using the membrane scaffold proteins to stabilize, disperse 
and solubilize fully or partially hydrophobic proteins such as tethered, embedded or integral 
membrane proteins while maintaining the biological activities of those membrane proteins or to 
stabilize, disperse and solubilize membrane fragments. 

Several years ago we developed a new system for the study of membrane proteins by 
scanning probe microscopy, based on the adsorption of synthetic high density lipoprotein disks 
(rHDL, apo A-I) onto mica in an oriented manner (Carlson et al., 1997; Bayburt et al., 1998; 
Bayburt et al., 2000; Carlson et al., 2000). The diameters of the discoidal structures observed are 
approximately 10 nm with a height of 5.5 nanometers. The 5.5 nm high topology observed is 



1 



WO 02/40501 



PCT/US01/43451 



most compatible with a single membrane bilayer epitaxially oriented on the atomically flat mica 
surface (Carlson et al. s 1997). 

Purified membrane proteins can be reconstituted into the phospholipid bilayer domain of 
certain such discoidal structures and studied in solution or subsequently adsorbed on a suitable 
5 surface for either atomic force microscopy or examination by spectroscopic techniques that take 
advantage of a surface of oriented protein-bilayer assemblies. Additionally, the underlying 
discoidal structures containing the membrane protein are easily recognizable and provide a point 
of reference for judging the quality of the sample and images. A tethered membrane protein, 
NADPH-cytochrome P450 reductase, was incorporated and physically studied in rHDL bilayer 

10 1 disks (Bayburt et al., 1998; Bayburt et al., 2000). The reductase can be incorporated into 10 nm 
diameter rHDL disks, those disks can be absorbed onto mica, and the catalytic domain of the 
reductase, which protrudes from the top of the bilayer structure, can be imaged. The incorporated 
en2yme is active on such a surface, with a turnover number consistent with that obtained with 
particulate membrane preparations. Force curve analysis has been used to estimate the height of 

15 the domain and its compressibility under the force of the AFM probe (Bayburt et al., 2000). The 
height of the molecule above the bilayer surface corresponds to the predicted height based on the 
recent X-ray crystal structure (Wang et al., 1997). Cytochrome P450 reductase can be 
incorporated in active form in MSP-supported nanoscale structures of the present invention. 

High-density lipoproteins (HDL) are assemblies of a protein component, termed apo A-I, 
20 and various phospholipids. HDL particles play an important role in mammalian cholesterol 
homeostasis by acting as the primary conduit for reverse cholesterol transport (Fielding and 
Fielding, 1 991). The function of HDL as a cholesterol transporter relies upon the activity of the 
HDL-associated enzyme lecithin-cholesterol acyl transferase, or LCAT (Glomset, 1968; Jonas, 
1991), which mediates the insertion of cholesterol esters into HDL lipoprotein particles. Certain 
25 portions of the apo A-I protein are required for the activity of this enzyme (Holvoet et al., 1 995). 
In addition, a part of the apo A-I protein is thought to be in a globular domain at the N-terminus, 
and to be responsible for interactions with cell surface receptors. One nascent form of HDL 
particles has been assumed to be that of a discoid based on electron microscopy of stained 
preparations (Forte et al., 1971). Our laboratory has confirmed this using AFM studies of 
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synthetic forms of rHDL under aqueous conditions. This form, however, is not the predominant 
form in circulation in vivo. Rather, the apo A-I structure appears to have evolved to stabilize a 
spherical form. 

Two general models for the structure of HDL disks have been proposed. One model has 
5 the apo A-I protein surrounding a circular bilay er section as a horizontal band or "belt" composed 
of a curving segmented alpha helical rod (Wlodawer et al., 1979). The other model has the 
protein traversing the edges of the bilay er vertically in a series of alpha helical segments (Boguski 
et al., 1986). Both models are based primarily on indirect experimental evidence, and no 
definitive means of distinguishing between them has emerged. Sequence analysis of the apo A-I 
1 0 genes suggests that the protein folds into a series of helices roughly 22 amino acids long, which 
is consistent with roughly a bilayer thickness. The placement of the helices in the disks has been 
predicted by computer modeling (Phillips et al., 1997) and attenuated total reflectance infrared 
spectroscopic measurements (Wald et al., 1990). These efforts suggested the helices lie roughly 
parallel to the acyl chains and are slightly shorter than the thickness of a bilayer. This 
15 arrangement of proteins and lipid is consistent with the picket fence model. 

A belt model is consistent with some electron microscopy and neutron scattering data 
(Wlodawer et al., 1979), where the helices are arranged longitudinally around the edge of the 
bilayer disks like a "belt". More recent infrared spectroscopy studies using a new method of 
sample orientation for dichroism measurements are more consistent with the belt model, in 

20 contrast to earlier studies (Wald et al., 1990; Koppaka et al., 1999). So far, there is no 
compelling direct evidence as to which model is correct, even though a low resolution x-ray 
crystal structure for apo A-I crystallized without lipid (Borhani et al., 1997) has been obtained. 
The low resolution crystal structure of an N-terminally truncated apo A-I shows a unit cell 
containing a tetrameric species composed of 4 helical rods which bend into a horseshoe shape 

25 and which combine to give a circular aggregate about 125 x 80 x 40 A. It was suggested that a 
dimeric species in this belt conformation is capable of forming discoidal particles. 

The information collected to date concerning the reverse cholesterol transport cycle and 
the maturation of HDL particles suggests that the apo A-I protein has unique properties that allow 
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it to interact spontaneously with membranes resulting in the formation of lipoprotein particles. 
Initial apo A-I lipid binding events have been proposed (Rogers et al, 1998), but conversion of 
bilayer-associated forms to discoidal particles remains unclear. The available evidence suggests 
that the energy of stabilization of lipid-free apo A-I is fairly low and that there is an equilibrium 
between two conformers (Atkinson and Small, 1986; Rogers et al., 1998). One conformer may 
be a long helical rod, and the other may be a helical "hairpin" structure about half as long. It has 
been suggested that the low stabilization energy and conformational plasticity allow apo A-I to 
structurally reorganize when it encounters a lipid membrane, thus facilitating the structural 
changes that would have to take place in both the membrane and the protein to produce discreet 
lipoprotein particles (Rogers et al., 1998). Once these particles are formed, apo A-I may still 
undergo specific conformational changes that contribute to the dynamic functionality of the 
lipoprotein particles. All of these interactions and changes take place at the protein-lipid 
interface. Thus, there is little reason to believe that apo A-I itself would be ideal for generating 
a stable, nanometer size phospholipid bilayer. 

Synthetic rHDL form spontaneously upon interaction of apo A-I with phosphatidylcholine 
liposomes at certain protein-lipid ratios and temperatures at or above the phase-transition 
temperature of the lipid (Jonas, 1986). The method of detergent dialysis of mixtures of apo A-I 
and phospholipid is also used to form particulate structures and affords a method of incorp orating 
purified membrane proteins. The sizes of discoidal particles formed depend on the protein to 
lipid ratio of the formation mixture and reflect the diameter of the bilayer domain (Brouillette et 
al., 1984; Wald et al., 1990). Size classes therefore arise from the number of associated apo A-I 
molecules at the perimeter of the phospholipid disk. These classes have been termed LP 1 , LP2, 
LP3, and LP4 for the stoichiometry of apo A-I protein molecules per disk. Variable sizes within 
the LP classes also arise due to heterogeneity in the conformation of apo A-I. One aspect of the 
present invention is based on the ability to identify the structure responsible for this heterogeneity 
and eliminate it to produce a monodisperse population of disk structures. Currently, the 
formation of homogeneous particles larger than 10 nm diameter requires separation of the 
particles from a mixture of species containing from 2 to 4 associated apo A-I molecules, while 
10 nm diameter particles are the major form at low apo A-I to phospholipid ratios during 
formation. The purity of single size classes and the ability to obtain high efficiencies of 
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membrane protein or membrane fragment incorporation requires alteration of the apo A-I 
structure. 

Different types of lipid aggregates are used for reconstitution and study of purified 
membrane proteins; these include membrane dispersions, detergent micelles and liposomes. See 

5 Fig. 1 . Purified systems for biochemical and physical study require stability, which is not always 
inherent in some systems. Liposomes are closed spherical bilayer shells containing an aqueous 
interior. Reconstitution into liposomes by detergent dialysis or other methods typically results 
in random orientation of the protein with respect to outer and lumenal spaces. Since ligands or 
protein targets are usually hy drophilic or charged, they cannot pass through the liposomal bilayer 

10 as depicted in Fig. 1 , although this may be advantageous in some instances. Since both sides of 
the liposomal bilayer are not accessible to bulk solvent, coupling effects between domains on 
opposite sides of the bilayer cannot be studied. Liposomes are also prone to aggregation and 
fusion and are usually unstable for periods of more than about a week or under certain physical 
manipulations, such as stopped flow or vigorous mixing. The size of liposomes obtained by 

15 extruding through defined cylindrical pore sizes polydisperse in size distribution rather than 
exhibiting a uniform diameter. 

Liposomes also may present difficulties due to light scattering, and aggregation of 
membrane proteins present in the bilayer. The surface area of a liposome is relatively large (1 0 5 
to 1 0 8 A2). To obtain liposomes with single membrane proteins requires a large lipid io protein 

20 molar ratio. Detergent micelles allow solubilization of membrane proteins by interaction with 
the membrane-embedded portion of the protein and are easy to use. Detergent micelles are 
dynamic and undergo structural fluctuations that promoter subunit dissociation and often present 
difficulty in the ability to handle proteins in dilute solutions. An excess of detergent micelles, 
however, is necessary to maintain the protein in a non-aggregated and soluble state. Detergents 

25 can also be mildly denaturing and often do not maintain the properties found in a phospholipid 
bilayer system. Specific phospholipid species are often necessary to support and modulate 
protein structure and function (Tocanne et al., 1994). Thus, the structure, function, and stability 
of detergent solubilized membrane proteins may be called into question. Since an excess of 
detergent micelles is needed, protein complexes can dissociate depending on protein 
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concentration and the detergent protein ratio. By contrast, the MSP nanostructures of the present 
invention are more robust structurally, having a phospholipid bilay er mimetic domain of discrete 
size and composition and greater stability and smaller surface area than unilamellar liposomes. 
The disk structures allow access to both sides of the bilayer like detergents, and also provide a 
5 bilayer structure like that of liposomes. 

There is a long felt need in the art for stable, defined compositions for the dispersion of 
membrane proteins and other hydrophobic or partially hydrophobic proteins, such that the native 
activities and properties of those proteins are preserved. 

SUMMARY OF THE INVENTION 

10 Membrane Scaffold Proteins (MSPs) as used herein are artificial proteins which self 

assemble with phospholipids and phospholipid mixtures, or in the absence of phospholipids, into 
nanometer size membrane bilayers. A subset of these nanometer size assemblies are discoidal 
in shape, and are referred to as nanodisc or nanodisk structures. These "nanoscale" particles can 
be from about 5 to about 500 nm, about 5 to about lOOnm or about 5 to about 50 nm in diameter. 

15 These structures comprising phospholipid and MSP preserve the overall bilayer structure of 
normal membranes but provide a system which is both soluble in solution and which can be 
assembled or affixed to a variety of surfaces. The amino acid sequences of specifically 
exemplified MSPs are given in SEQ ID NO:6, SEQ ID NO:9, SEQ ID NO: 17, SEQ ID NO: 19, 
SEQ ID NO:23, SEQ ID NO:29, SEQ ID NO:43, SEQ ID NO:44 and SEQ ID NO:45. 

20 The present invention further provides the use of the nanometer scale phospholipid 

bilayer structures or nanodiscs formed using the MSPs of the present invention are useful for the 
incorporation of additional hydrophobic or partially hydrophobic protein molecules. Those 
additional proteins can be solubilized, for example, with the use of detergent, and the solubilized 
proteins can be added to a solution of MSP, with or without phospholipid(s), and the nanoscale 

25 particles self assemble so that the MSPs and the additional proteins are incorporated into a stable 
and soluble particle. Subsequently, any detergent can be removed by dialysis. Those proteins, 
found in nature in the various membrane structures of a living organism, are solubilized in the 
MSP supported nanobilayer or nanodisc, and the native structure and activity of the target protein 
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are preserved in these MSP-supported structures. Besides hydrophobic or partially hydrophobic 
proteins, membrane fragments or disrupted membranes can be assembled with the MSPs of the 
present invention. 

The MSP supported bilayers or nanodiscs can be used in solutions or applied to a number 
5 of surfaces, such that the native structure and ligand binding of the natural protein incorporated 
in the MSP supported structure are maintained. As specifically exemplified, the MSP supported 
structures are affixed to a gold surface, e.g., for use in surface plasmon resonance technologies. 

The present invention relates to methods for the incorporation of integral membrane 
proteins into nanoscale lipid bilayers or nanodiscs comprising at.least one MSP of the present 

10 invention. Three classes of membrane proteins (tethered, embedded, and integral) can be used 
in the methods of the present invention. The first membrane protein class is the tethered; this 
class is exemplified by NADPH-cytochrome P450 reductase and human tissue factor. Examples 
of embedded membrane proteins include, without limitation, cytochrome P450 2B4 from rabbit 
liver microsomes, cytochrome P450 3 A4 from human liver microsomes and cytochrome P450 

15 6B1 from insect microsomes. The integral membrane proteins are exemplified by the 7-helix 
transmembrane proteins, including, but not limited to, bacteriorhodopsin from Halobacterium 
halobium, the 5-hydroxy tryptamine 1 A G-protein coupled receptor from Homo sapiens and other 
G-protein coupled protein receptors. Members of each class of membrane protein have been 
successfully incorporated into the nanoscale structures using the MSPs and methods of the 

20 present invention. In particular, cell surface receptors, and especially G-protein coupled 
receptors, can be incorporated into nanobilayer bilayer structures formed from a class of 
membrane scaffold proteins (MSPs). 

We developed nanodiscs for use in structural, biochemical and pharmaceutical techniques 
by engineering the scaffold protein (MSP) for greater stability, size homogeneity and useful 
25 functionalities in the resultant nanoscale lipoprotein particle. These particles can form tags for 
purification and physical manipulation of disks such as in hydrogels or on a gold biosensor 
surface, and they can serve as entities for rapid and reproducible assay and crystallization. The 
nanoparticles and membrane protein scaffolds are useful in biotechnology, the pharmaceutical 
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industry as well as in basic research. In addition, the structural and functional principles 
uncovered through our discovery and the related techniques facilitate understanding the 
interactions of proteins with lipid bilayers at the molecular level. 

Another aspect of this invention relates to a class of membrane scaffold protein (MSP) 
5 that can be used to solubilize membrane proteins and complexes in a functionally stable 
monodisperse phospholipid-bilayer associated form. Furthermore, this MSP provides means of 
physical manipulation of single membrane proteins. The MSP is modeled after human 
apolipoprotein A-I, which under certain conditions, can self-assemble with phospholipid to form 
discoidal structures having diameters of 100 to 200 A (Brouillette et aL, 1984). Other 
10 amphipathic proteins could also have served as a starting point. Although apoprotein A-I was 
known, it has not been used in any general method to solubilize and study tethered, embedded 
or integral membrane proteins of completely unrelated origin. A specific embodiment of this 
invention is the use of lipid-associated MSPs for the solubilization, manipulation and study of 
membrane proteins. 

15 The present invention further provides materials and methods using genetically 

engineered MSPs to minimize the size of the MSP with respect to membrane particle formation 
and to increase the stability and monodispersity of the self-assembled nanoparticle by altering 
the sequence of the parent molecule. In particular, we have developed MSPs for the study of 
G-protein coupled receptors. G-protein coupled receptors (GPCRs) are an important and diverse 

20 class of pharmaceutical targets in mammalian cellular membranes where they function as signal 
transducing elements, bind several classes of bioactive ligands and transmit information to the 
intracellular machinery. The artificial MSP of the present invention stabilizes and solubilizes the 
membrane-associated form of GPCR to allow purification and manipulation in solution or on a 
solid support for use in high throughput screening applications and on surfaces for 

25 surface-plasmon biosensor and scanning-probe techniques. The artificial MSP of the present 
invention can be used to facilitate expression and purification of recombinant membrane proteins 
in soluble form. 
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Also within the scope of the present invention are recombinant DNA molecules which 
encode MSPs and host cells containing those recombinant DNA molecules which are used to 
produce the MSPs. MSPs encoded by these recombinant DNA molecules include those 
comprising amino acid sequences including, but not limited to, SEQ ID NO:6, SEQ ID NO:9, 
SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:23, SEQ ID NO:29, SEQ ID NO:43, SEQ ID 
NO:44andSEQIDNO:45. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 schematically illustrates different types of lipid aggregates incorporating a 
membrane protein. Small circles and triangles represent ligand for intracellular and extracellular 
domains of the receptor proteins, respectively. 

Figure 2 shows the wheel structure of an alpha helix, with the placement of hydrophobic 
and hydrophilic amino acid side chains that give the helix its amphipathic character. 

Figure 3 A is a schematic representation of a "picket fence" model for MSP stabilization 
of a bilayer. The circles represent single a helices with a diameter of about 1 .5 nm and 0.54 nm 
per turn of the helix (3.6 amino acid residues per turn). Figure 3B is a schematic of a "belf 5 
model of an MSP supported bilayer. The rectangles represent single a helices with a diameter of 
about 1 .5 nm and a helix length of about 3.9 nm. 

Figure 4 presents the sequence and secondary structure prediction for the parent apo A-I 
molecule. Stars denote helical repeat boundaries. Underlined sequence in bold represents 
a-helical structure, italics represent b structure and plain text represents potential turns. 

Figures 5A-5G illustrate various engineered MSP structures, shown with picket fence 
topology and helical assignments based on sequence analysis. Fig. 5A: MSP1 showing positions 
of half-repeats. Half-repeat 1 is disordered based on molecular dynamics simulation (Phillips, 
1997). Fig. 5B: Hinge domain movement. Fig.SC: Removal of half-repeats. Fig. 5D: Hinge 
domain replacement with helices 3 and 4. Fig. 5E: MSP2, with a tandem duplication of the 
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sequence of MSP1. Fig. 5F: Removal of half-repeat 1 to make MSP1A1. Fig. 5G: Tandem 
repeat of MSP1 Al to form MSP2A1. 

Figures 6A-6B diagrammatically illustrate the PCR strategy used to amplify artificial 



Figures 7A-7B shows diagrams of MSP2 with (Fig. 7A) and without a long linker 
sequence (Fig. 7B). 

Figures 8 A-8B illustrate the strategy for constructing and expressing an artificial sequence 
encoding an MSP1 derivative lacking helices 4 and 5. 

Figures 9 A-9B illustrate the strategy for constructing and expressing an artificial sequence 
encoding an MSP1 derivative lacking helices 5 and 6. 

Figure 10 provides a gel filtration elution profile of bacteriorhodopsin reconstituted into 
MSP1 structures. 

Figure 1 1 show a gel filtration chromatogram of bacteriorhodopsin solubilized by MSP 1 
in the absence of added phospholipid. Bacteriorhodopsin is detected by absorbance at 550 nm, 
while MSP1 and bacteriorhodopsin protein is detected by absorbance at 280 nm. 

Figure 12 is a gel filtration chromatogram of bacteriorhodopsin solubilized by MSP2 in 
the absence of added phospholipid. Bacteriorhodopsin is detected by absorbance at 550 nm, 
while MSP2 and bacteriorhodopsin protein is detected by absorbance at 280 nm. 

Figure 13 illustrates nanoscale particle formation with varying lipid to MSP ratios. 
Particles were formed at the indicated lipid to MSP mole ratio and separated by native gradient 
gel electrophoresis. As indicated at the right, MSP1 forms 8.2, 9.6 and 10.6 nm diameter 
particles. MSP2 forms predominantly 9.6 nm particles. 



MSPs. 
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Figure 14 shows the results of chemical denaturation of MSP 1 and MSP2 particles (9 nm 
diameter) in complexes with dipalmitoyl phosphatidylcholine as monitored by tryptophan 
fluorescence. Excitation was at 280 nm, and the buffer was 1 0 mM Hepes pH 7.5; 0. 1 5 M NaCl. 

Figures 15A-15B show the membrane proteins incorporated into disks and attached to 
solid supports. Fig. 15 A: Disk-associated receptor and ligand-induced assembly of 
receptor-target complex on gold. Fig. 15B: Disk-associated receptor in gel matrix. 

Figure 16A shows the results of HPLC over a sizing column of nanodisc particles with 
retention times indicating 8 and lOnm sizes and Tissue Factor activity at25-28min. Figure 16B 
shows the results of SDS-PAGE using 8-25% gradient gels; the MSP 1 -supported nanodisc 
bilayers isolated from the HPLC and containing TF have the expected molecular weights. 

Figures 1 7A-1 7B illustrate thee separation of cytochrome b5 into discs as determined by 
native PAGE using 8-25% gradient gels. Lane 1 , molecular weight markers; lane 2, cytochrome 
b5; lane 3, disks; lane 4, disk/cytochrome b5; lanes 5 and 6, anion exchange purified cytochrome 
b5 in disks representing two pools eluted from the anion exchange column. Fig. 17A shows 
protein stained with Coomassie blue, and Fig. 1 7B shows heme-specific staining. Note that the 
232 kDa marker protein (catalase) also stained for heme. 

Figure 1 8 is a chromatogram of cytochrome P450 3 A4 incorporated into 10 nm bilayer 
discs consisting of 85% DPPC, 15% POPC. 

Figure 19 is a chromatogram of cytochrome P450 3A4 incorporated into 10 nm bilayer 
discs consisting of 85% DPPC, 15% POPS. 

Figure 20 is a chromatogram of cytochrome P450 3A4 incorporated into 10 nm bilayer 
discs consisting of 100% DPPC. 

Figure 20 shows the results of PAGE over an 8-25% gradient gel with three disc samples 
which correlate with the sizes of the nanodisc particles. 
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Figure 22 illustrates the results of HPLC chromatography of MSP-solubilized cytochrome 
P450 6B1 over a Superdex 200 sizing column. Retention times indicate nanodisc particles 
containing 6B1 (420 nm trace), about 1 The flow rate was 0.25 ml/min. 

Figure 24 illustrates the results of PAGE with sample 1 (nanodiscs prepared with 
5 microsomal membranes from cells coexpressing cytochrome P450 6B1 and NADPH P450 
reductase. Sample 2 contains control microsomes. 



Figure 24 provides a [CO-Fe +2 - CO-Fe 3 *] optical difference spectrum. Active 
cytochrome P450 6B1 incorporated into nanodiscs absorbs at 450 nm. 

Figure 25 depicts a chromatogram of sample separated by a Superdex sizing column. 
1 0 Retention times indicated rHDL particles 1 0 nm in size. 

Figure 26 illustrates co-incorporation of cytochrome P450 reductase and cytochrome 
P450 6B1 in MSP nanodiscs. The ratio of absorbances at 456 nm (predominantly reductase) to 
that at 420 nm (predominantly P450) is plotted as a function of retention time. The peak at about 
26 min indicates a nanodisc population containing both reductase and cytochrome. 

1 5 Figure 27 illustrates the binding of DPPC nanodiscs containing carboxy 1 terminated thiols 

to a gold surface, as monitored by surface plasmon resonance. 



DETAILED DESCRIPTION OF THE INVENTION 
Abbreviations used in this application include A, Ala, Alanine; M, Met, Methionine; C, 
Cys, Cysteine; N, Asn, Asparagine; D, Asp, Aspartic Acid; P, Pro, Proline; E, Glu, Glutamic 
20 Acid; Q, Gin, Glutamine; F, Phe, Phenylalanine; R, Arg, Arginine; G, Gly, Glycine; S, Ser, 
Serine; H, His, Histidine; T, Thr, Threonine; I, He, Isoleucine; V, Val, Valine; K, Lys, Lysine; 
W, Try , Tryptophan; L, Leu, Leucine; Y, Tyr, Tyrosine; MSP, membrane scaffold protein; DPPC, 
dipalmitoyl phosphatidylcholine; PC, phosphatidylcholine; PS, phosphatidyl serine; BR, 
bacteriorhodopsin. 
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The simplest single-celled organisms are composed of central regions filled with an 
aqueous material and a variety of soluble small molecules and macromolecules. Enclosing this 
region is a membrane which is composed of phospholipids arranged in a bilayer structure. In 
more complex living cells, there are internal compartments and structures that are also enclosed 

5 by membranes. There are many protein molecules embedded or associated within these 
membrane structures, and these so-called membrane proteins are often the most important to 
determining cell functions including communication and processing of information and energy. 
The largest problem in studying membrane proteins is that the inside of the phospholipid bilayer 
is hydrophobic and the embedded or anchored part of the membrane protein is itself also 

1 0 hydrophobic. In isolating these membrane proteins from their native membrane environments, 
it is very difficult to prevent them from forming inactive aggregates. The present invention 
provides ways to generate a soluble nanoparticle that is itself a native-like phospholipid bilayer 
into which hydrophobic proteins of interest (target proteins) can be incorporated to maintain the 
target protein as a soluble and monodisperse entity. This is accomplished by incorporating 

15 hydrophobic proteins such as membrane proteins into nanometer scale structures. 

Membrane Scaffold Proteins (MSPs) as used herein are artificial proteins which self 
assemble phospholipids and phospholipid mixtures into nanometer size membrane bilayers. A 
subset of these nanometer size assemblies are discoidal in shape, and are referred to as nanodisc 
or nanodisc structures. These structures preserve the overall bilayer structure of normal 
20 membranes but provide a system which is both soluble in solution and which can be assembled 
of affixed to a variety of surfaces. 

The MSPs of the present invention must be amphipathic, with one part of its structure 
more or less hydrophilic and facing the aqueous solvent and the other part more or less 
hydrophobic and facing the center of the hydrophobic bilayer that is to be stabilized. 
25 Examination of the basic biochemical literature reveals two candidates of protein structures that 
have this required amphipathic character: the alpha-helix and the beta-sheet. Although there 
are examples in the literature where beta-sheets can fold upon themselves to create a structure 
where the "inside" is hydrophobic and the "outside" is hydrophilic, the central cavity so formed 
in the simplest of these structures is small. Such a small internal region might stabilize a 
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phospholipid bilayer, but the size would be too small to incorporate any desired membrane 
protein target. Hence, we designed the MSPs of the present invention to have an alpha helix as 
a fundamental amphipathic building block. Each MSP has an amino acid sequence which forms 
amphipathic a-helices with more hydrophobic residues (such as A, C, F, G, I, L, M, V, W or Y) 
predominantly on one face of the helix and more polar or charged residues (such as D, E, N, Q, 
S, T, H, K, or R) on the other face of the helix. See Fig. 2 for a schematic representation. In 
addition, the helicai structure is punctuated with proline residues (P) about every 20-25 amino 
acids to form "kinks" or to initiate turns to facilitate the "wrapping" of the MSP around the edge 
of a discoidal phospholipid bilayer. See Fig. 2, which depicts a generalized linear amino acid 
sequence and a helical wheel diagram showing the placement of predominantly hydrophobic 
amino acids on one face of the helix. The exact amino acid sequence can vary in the positioning 
and number of the hydrophobic amino acids within the designed linear sequence. Simple models 
in which either the helical axis is parallel or perpendicular to the normal of the Nanodisc bilayer 
can be generated; See Figs. 3 A and 3B. To generate a disk with diameter of roughly 10 nm, an 



MSP comprises about 12 to about 20 or more repeating units having this generalized amphipathic 
sequence. Preferably, this protein would be composed of amphipathic alpha helices each with 
a length of between 14 and 25 amino acids, punctuated in the linear sequence by a residue 
unfavorable for helix formation, such as proline, which form small helical building blocks that 
stabilize the hydrophobic core of the phospholipid bilayer. These small helical segments are 
linked together with about 1 to about 5 amino acid residues. To cover the edge of a 10 nm 
discoidal particle in either the "belt" or "picket fence" models presented, one would need 
between 10-20 such helices, with 16 being the optimal number based on the simple graphic 
analysis of Figs. 3A and 3B. We thus built a synthetic gene to express a protein containing the 
desired amphipathic helices. 

The MSPs of the present invention can be used to solubilize tethered, embedded or 
integral membrane proteins in nanoscale structures. Tethered membrane proteins are composed 
mostly of a relatively soluble globular domain external to the bilayer and relatively simple (e.g., 
a single pass helix) which anchors this domain to the membrane bilayer. The globular domain, 
in nature, can be extracellular or cytoplasmic in orientation. Embedded membrane proteins, as 
defined herein, are those which include a membrane anchoring segment of the polypeptide, but 
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which also have groupings of hydrophobic amino acids on the surface of the protein, which 
hydrophobic domains are embedded within the membrane bilayer. Integral membrane proteins 
are predominantly located within the membrane bilayer; relatively small portions of the protein 
are exposed to an aqueous environment within the cell or to the extracellular aqueous 
5 environment. 

The tethered membrane protein class is exemplified by NADPH-cytochrome P450 
reductase (e.g., from rat liver endoplasmic reticulum), cytochrome b5 and human tissue factor. 
NADPH-Cytochrome P450 reductase is a membrane protein found in the endoplasmic reticulum 
and catalyzes pyridine nucleotide dehydration and electron transfer to membrane bound 

1 0 cytochrome P450s. Isozymes of similar structure are found in humans, plants, other mammals, 
insects etc. Tissue factor (TF), or thromboplastin, is a 30,000 Da type-I tethered membrane 
protein critical to initiation of the blood coagulation cascade. This membrane-bound protein acts 
as an activation cofactor for factor VII, the soluble serine protease which carries out the first 
enzymatic step in blood coagulation. Expression of tissue factor is limited to cells that are not 

1 5 in direct contact with blood plasma, forming a "hemostatic envelope" around vasculature and the 
entire organism. High levels of TF are found in the skin, brain, and the adventital layer which 
surrounds blood vessels. The TF:VII complex must be assembled on a membrane surface to 
exhibit high activity, and optimal activity is seen only when the membrane contains 1 
phospholipids with negatively charged headgroups. Cytochrome b5 is a membrane-anchored 

20 (tethered) heme protein having a single membrane anchor domain that penetrates the membrane 
bilayer. Cytochrome b5 solubilized from its native membrane exists as large aggregates in the 
absence of detergent and appears as a smear rather than a discrete band on native polyacrylamide 
gel electrophoresis (PAGE) (Fig. 17, lane 2). Formation of Nanodiscs through the self-assembly 
process using MSPs taught in our invention, wherein cytochrome b5 is added to the preparation 

25 of MSP and phospholipid results in incorporation of cytochrome b5 into disk-sized structures 
(lane 4 of Fig. 17). This is verified by the intense heme-staining of the band corresponding to 
Nanodiscs in the right panel. Cytochrome b5-containing Nanodiscs separated by anion-exchange 
chromatography are shown in lanes 5 and 6 of Fig. 17. Two peaks elute from the anion exchange 
column near 310 mM NaCl and near 370 mM NaCl. Disks alone were observed to elute near 3 1 0 

30 mM NaCl and cytochrome b5 alone to elute between 450 and 700 mM NaCl. This data 
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demonstrates that cytochrome b5 is solubilized using the MSP technology and that disk 
complexes containing cytochrome b5 can be chromatographically separated and purified from 
undesired aggregated material. The optical absorption properties of the heme chromophore of 
the purified material show that the heme active site is in a native conformation. 



5 Examples of embedded membrane proteins include, without limitation, cytochrome P450 

2B4 from rabbit liver microsomes, cytochrome P450 3A4 from human liver microsomes and 
cytochrome P450 6B1 from insect microsomes. The cytochromes P450 are a superfamily of 
enzymes that are found in all forms of life. One role of many mammalian P450s is to detoxify 
xenobiotics; for instance, human liver P450s detoxify most endogenous and exogenous 

1 0 compounds, and these enzymes determine the mean plasma lifetime of all drugs ingested. One 
of the most widely studied human liver cytochrome P450s is cytochrome P450 3 A4 (CYP 3 A4). 
This membrane bound P450 is the most highly expressed P450 in human liver and is responsible 
for metabolizing almost 50% of all pharmaceuticals (Guengerich, F.P., Cytochrome P450. 
Cytochrome P450,ed.P.R. Ortiz deMontellano. 1995, New York: Plenum Press. 473-535.) In 

1 5 order to demonstrate the utility of Nanodisc technology for the study of the cytochromes P450, 
we incorporated CYP 3A4 into MSP supported nanobilayer discs. Figs. 18-21 show that the 
retention times of the CYP 3A4 (observed by optical absorbance at 417 nm) and the nanodiscs 
(monitored at 280 nm where both MSP and P450 absorb) elute from the column at the same time, 
approximately 24 minutes. This elution time also correlates closely to the calculated retention 

20 time of the disc protein complex. Further evidence that supports this is a native poly acrylamide 
gel electrophoresis (PAGE) that directly measures the size of eluted Nanodisc particles (Fig. 21). 

Cytochrome P450 6B1 (CYP 6B1) is a member of the large cytochrome P450 
monooxygenase protein superfamily, and it is another example of an embedded membrane 
protein. CYP 6B 1 has been isolated from Papilio polyxenes, the black swallow tail, which feeds 
25 exclusively on plants producing furanocoumarins, plant metabolites that are phototoxic to most 
organisms. CYP 6B1 catalyzes the detoxification of furanocoumarins by what is believed to be 
an epoxidation reaction (Ma, R. et al. (1994) Arch Biochem. Biophys. 310(2), 332-40). In order 
to show a new application of the MSP technology of the present invention, we demonstrates that 
isolated membranes containing their repertoire of membrane proteins can be incorporated into 
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nanodisks comprising MSPs. A particularly important embodiment is the use of the common 
insect cell culture- baculovirus expression system which is used widely as a heterologous 
expression system. Thus, we used a commercially available Sf9 insect cell line co-infected such 
that a microsomal preparation containing overexpressed insect CYP 6B 1 and an over-expressed 
insect NADPH cytochrome P450 reductase was produced Hence, we not only demonstrated that 
MSP nanodiscs can be used to incorporate another cytochrome P450 system into soluble 
monodisperse particles but also that the source of this P450 could be simply the whole 
membranes from the Sf9 cell line that has been infected with a cloned CYP 6B1 gene. Thus, 
MSP supported nanodiscs can be produced for use in high-throughput screening ventures such 
as the identification of ligands for membrane-associated proteins and for the identification of new 
pharmaceuticals. This application can be extended to any other source of membrane fragments 
containing target proteins of interest, such as any mammalian cell culture system or mammalian 
expression system. 

An important utility of the nanodisc technology of the present invention is in high 
throughput screening for enzymatic or receptor binding activity. In many such systems, it is 
advantageous to have more than one membrane protein target incorporated into the nanodiscs, 
for example, of the electron transfer partner needed for P450 monooxygenase catalysis or the 
corresponding G-protein incorporated with a G-protein coupled receptor. In order to demonstrate 
the utility of the MSP nanodisc technology in these situations, we successfully incorporated the 
NADPH cytochrome P450 reductase and a cytochrome P450 6B 1 . As demonstrated herein, each 
target membrane protein can be individually incorporated into nanodiscs using MSPs or they can 
be incorporated in combinations. The endogenous relative amounts of cytochrome P450 to 
reductase is about 1 0-20 P450 molecules per reductase molecule (Feyereisen, R. (1 999). Annual 
Review of Entomology 44, 501-533). To obtain activity of CYP6B1 after reconstitution into 
disks, it is preferable that an excess amount of reductase be added to the reconstitution mixture, 
such that a P450 molecule and reductase molecule both partition into a single disk. 
Supplementation of the microsomal preparation with exogenously added reductase has been 
accomplished. The sample was separated by a Superdex sizing column, where absorbance at 280 
nm indicates presence of MSP 1, absorbance at 420 nm and 456 nm indicates presence of ferric 
species, and absorbance at 456 nm also indicates presence of reductase (Fig. 25). A ratio plot of 
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456 nm to 420 nm was made which showed positions on the chromatogram where the absorbance 
at 456 nm was above that associated with CYP6B1 and, therefore, could be attributed to 
absorbance by reductase (Fig. 26). Retention times correlated with the presence of 1 0 nanometer 
particles containing both CYP6B1 and reductase. See also Figs. 22-24. 

The integral membrane proteins are exemplified by the 7-helix transmembrane proteins, 
including, but not limited to, bacteriorhodopsin from Halobacterium halobium, the 5-hydroxy 
tryptamine 1A G-protein coupled receptor from Homo sapiens and other G-protein coupled 
protein receptors. Members of each class of membrane protein have been successfully 
incorporated into the nanoscale structures using the MSPs and methods of the present invention. 
In particular, cell surface receptors, and especially G-protein coupled receptors, can be 
incorporated into nanobilayer bilayer structures formed from a class of membrane scaffold 
proteins (MSPs). BR has been incorporated into the MSP nanodiscs as described herein below, 
and we also used a commercially available insect cell expression system that provides a 
membrane fraction hosting the G-protein coupled receptor human for 5-HT-l A (serotonin). The 
ligand binding activity documented for 5-HT-l A incorporation into nanodiscs proves that the 
protein is in the active conformation in the nanodiscs of the present invention. 

Fig. 10, which shows a gel filtration elution profile of the MSP/BR/dimyristoyl 
phosphatidylcholine synthesis mixture, demonstrates that bacteriorhodopsin has been solubilized. 
In the absence of MSP, bacteriorhodopsin elutes quantitatively in the void fractions. A major 
peak of bacteriorhodopsin elutes at a position slightly earlier than the majority of MSP disks 
(small complexes -100 A in diameter, based on Stake's radii of calibration standards). Larger 
complexes containing BR are also formed. The BR present in the complexes can be specifically 
bound to and eluted from a nickel affinity column through an engineered MSP that contains a 
6His tag. In addition, the elution profile of BR in gel filtration is remarkably similar to elution 
profiles of MSP disks in the absence of BR The spectrum of BR in the complexes resembles 
that of detergent-solubilized monomeric BR and appears unchanged for several weeks upon 
storage at 4°C. Surprisingly, MSP quantitatively solubilizes BR in the absence of exogenously 
added phospholipid. These lipid-poor complexes are also the size of small MSP/phospholipid 
disks. 
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We created an artificial MSP (MSP2) by designing a tandem repeat of MSP 1 connected 
by a short linker to create a new molecule. See Fig. 5G and SEQ ID NO: 17. Relatively large 
quantities (tens of milligrams/liter cell culture) of the artificial MSPs of the present invention are 
produced in a bacterial expression system. Our constructs reduce the number of size classes that 
can be formed (those corresponding to three MSP 1 molecules). Preliminary evidence indicates 
that the sizes of major species formed with MSP2 correspond to two and four MSP1 molecules. 
In addition, the smallest disk sizes due to alternate conformations of membrane scaffold proteins 
that are found in preparations with MSP1 at lowphospholipid:MSP ratios do not exist for MSP2. 
Without wishing to be bound by theory, we believe that the smaller particles contain a single 
molecule of MSP2 and the larger disks contain two molecules. 

The scaffold protein (MSP) has been engineered to minimize the variability in the 
structure of the discoidal phospholipid bilayer entities, provide greater structural stability and 
increased size homogeneity of the disk structures, and to incorporate useful functionalities such 
as tags for purification, and physical manipulation of disks. Disk homogeneity is necessary for 
efficient incorporation of single membrane proteins or single membrane protein complexes into 
a single size class of disk. The parent molecule, apo A-I, has functions beyond disk structure 
stabilization; these include cellular receptor binding, LCAT activation and structural 
transformations between various lipoprotein species (Forte et al., 1971; Holvoet et al., 1995; 
Fidge, 1999). These functional regions are unnecessary and often deleterious in the artificial 
bilayer systems of the present invention. The artificial scaffold protein can be used in studies of 
amphipathic helix membrane-protein structures. 

Secondary structure prediction provides a way of assessing structural features of the 
scaffold protein. The structure consists of mostly a-helix punctuated by prolines in the repeat 
sequences as shown in Figure 4. Eight to nine alpha helices are believed to associate with lipid 
in the form of disks. The N-terminal region of apo A-I is predicted to be more globular in 
character. This portion of the molecule has been removed to produce a construct that is capable 
of forming disks. An MSP that produces disk assemblies with highmonodispersity is desirable. 
The central helices (99-186) can be displaced by the lipid-fiee form of a related protein, 
apolipoprotein A-II, added to a solution of disk structures (Durbin and Jonas, 1999), and that 
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these helices can be part of a "hinge" domain (Figs. 5A-5B) that is dissociated from the edge of 
the disk, producing various particle diameters within the LP2, LP3 and LP4 classes (Jonas et al., 
1989). Disk forms with dissociated hinged regions are also more susceptible to proteolysis 
(Dalton and Swaney, 1993), which is undesirable. The deletion of pairs of the central helices 
(100-143, 122-165, and 144-186) results in recombinants that form disks of smaller size than 
full-length apo A-I, and in addition two of the mutants (122-165 and 144-186) have increased 
stability to chemical denaturation (Frank et al., 1997). Further work replaced helices 5 and 6 
(143-186) with another set of helices (Fig. 5D). Helices 3 and 4 contain regions that were 
thought to confer stability to disks (Frank et al., 1997). Helix 3 has a high lipid affinity and is 
believed to confer stability to the lipid-associated form through salt bridges. Helix 1 also has a 
high lipid affinity and is completely helical (Rogers et al., 1 998). A construct which incorporates 
the helix 1-2 pair in place of helix pair 5-6 is desirable. The roles of the half-repeats are of great 
interest. These 1 1-mer repeats are predicted to be a-helical, but are not long enough to span the 
bilayer in the picket fence model. In a molecular dynamics model of the disk in the picket fence 
model, the region corresponding to half-repeat 1 was in fact "floppy" and did not interact with 
lipid, while half-repeat 2 was found parallel to the bilayer near the headgroup region (Figs. 5A 
andSC). Such structures may confer disorder to the resulting disks. To ascertain the roles of half 
repeats and to further optimize the MSP structure and function, mutagenesis and directed 
evolution were used to generate variants that are described herein below. 

Receptors incorporated into MSP disks are useful in structural, biochemical and 
pharmaceutical research. Membrane protein study is currently limited to insoluble membrane 
dispersions, detergent micelles, and liposomes. Purified systems for biochemical and physical 
study require stability, which may or may not obtainable with detergents in many instances. 
Detergent micelles are dynamic and undergo structural fluctuations that promote subunit 
dissociation and present difficulty in the ability to handle proteins in dilute solution. The MSP 
nanobilayers (nanodiscs) are more robust structurally, having a phospholipid bilayer mimetic 
domain of discrete size and composition, and greater stability and smaller surface area than 
unilamellar liposomes. The particles of the present invention are stable in size and structure for 
at least a month at 4° C. 
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Signal transducing elements occur across membranes, while vesicles render one side of 
membrane in accessible to hydrophilic reagents and effector proteins. A specific embodiment 
of the present invention uses disks to stabilize pharmaceutical targets such as GPCRs, ion 
channels, receptor kinases, and phosphatases in a membrane-bound form on carrier particles. We 

5 have incorporated proteins with multiple spanning helices into the disks of the present invention, 
with a focus on GPCRs. We have successfully incorporated a model serpentine receptor, 
bacteriorhodopsin. Bacteriorhodopsin is a model for GPCRs, which are current targets for drug 
discovery. Currently, over 1 000 probable G-protein receptors from various organisms have been 
cloned and many of the so-called "orphan" receptors await identification of natural ligands. 

1 0 Ligand classes include peptide hormones, eicosanoids, lipids, calcium, nucleotides, and biogenic 
amines. GPCRs are believed to account for more than half of currently marketed 
pharmaceuticals. One of ordinary skill in the art can, without undue experiment, optimize 
methods of incorporation of this structural class of membrane proteins. Structural 
characterization of the reconstituted receptors are performed using chemical analysis, 

1 5 spectroscopy and atomic force microscopy as described herein below. 

The MSPs of the present invention are used in disks to solubilize, stabilize, manipulate 
membrane proteins. The MSPs of the present invention, when formulated onto disks, are 
applicable in surface technology such as biosensor chip for high throughput screening or solid 
phase assay techniques. Our work on disk scaffolds has also involved surface-associated 

20 assemblies. For instance, the SPR biosensor utilizes an approximately 50 nm thick gold film on 
an optical component to couple surface plasmons to a dielectric component (sample) at the 
surface of the gold film. MSP stabilized bilayers can be attached to the surface for use as a 
biomimetic layer containing proteins or other targets of interest by engineering cysteines into the 
MSP (Fig. 15A). The use of thiols is well known for attaching molecules to gold surfaces. The 

25 placement of the cysteine depends on the model used for placement of the cysteine residue(s). 
Based on the belt model, cysteines can be placed along the polar side of the amphipathic helix 
axis, provided that a cysteine residue is not positioned at the helix-helix interface. The 
helix-helix interface of the belt is believed to be in register with the position of apo A-I Milano 
(R173C), which forms disulfide-linked dimers (Segrest et al., 1999). An alternative is to 

30 introduce cysteines within a flexible - or C-terminal linker. Such a construct is, in theory, 
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capable of associating either the belt or the picket fence form of disk to a gold surface. 
Alternatively, thiol lipids can be incorporated within the bilayer domain. In addition to SPR, 
surface-associated disks on gold can be used in STM and electrochemical studies, for example, 
such as with membrane associated redox proteins, e.g. cytochrome P450 and its flavoprotein, as 
well as ion channels. 

SPR data can also be obtained from measurements made using a thin film of dielectric 
such as silicon dioxide applied over the metal film normally used as the substrate in SPR. This 
variation of the technique has been termed coupled plasmon waveguide resonance (CPWR) 
(Salamon et al., 1997a). Because silica can be used as the active surface in these plasmon 
resonance experiments, the process of producing a self-assembled bilayer can be simplified to 
the same procedures used to produce surfaces on mica or other silicon oxide surfaces. This has 
the added advantage of making the conditions used for the SPR experiments directly comparable 
to those used for AFM experiments. The CPWR technique can easily be performed on our SPR 
instrument by simply adding the silica coating to the metal film slides that are presently used for 
SPR spectroscopy. 

MSPs with available cysteine groups also enable specific labeling with chemically 
reactive groups or affinity tags for immobilization in gel matrices. Hydrogels with reactive 
coupling groups are useful for immobilizing proteins for SPR measurements. In the hydrogel 
configuration (Fig. 15B), the disk would serve as a carrier for bilayer-embedded membrane 
proteins in a monodisperse form with both intra- and extracellular domains available for ligand 
binding. We have already demonstrated that disks containing His tag bind to a metal chelate 
matrix which can be used to immobilize BR. This points to another use of the disk structure, i.e., 
in preparing affinity matrices for bioseparation processes, and measuring ligand affinities. The 
particles and techniques with the present invention are useful for drug discovery, 
structure/function correlation, and structure determination of membrane proteins. 

Current limitations to structure determination of membrane proteins are the abilities to 
produce large amounts of membrane proteins, and to crystallize these proteins. MSPs are useful 
as carriers for membrane protein stabilization and expression. MSP can serve to solubilize 
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membrane proteins for crystallization in lieu of detergents. Indeed, where the lipid bound form 
of MSP is structurally stable and rigid, crystallization is enhanced by introduction of crystal 
contacts through the MSP. We have already demonstrated that MSP1 or MSP2 can be used to 
solubilize BR in the presence and absence of exogenous lipid. Additional nonexemplified fusion 

5 constructs with membrane protein with an MSP region can be expressed in Escherichia coli using 
any of a number of art-known vectors. In this way, a stable and soluble form of the membrane 
protein that contains a membrane anchor is produced in large quantity. The exciting discovery 
that MSP solubilizes BR in the absence of added phospholipid allows the use of the artificial 
MSP to stabilize membrane proteins in the absence of detergents or lipid additives. The artificial 

1 0 MSPs of the present invention can be used in solubilization of BR and other membrane proteins 
including, but not limited to, cytochrome P450, cytochrome P450 reductase, and the 5-HT-l A 
receptor. 

One important goal in utilizing a membrane scaffold protein (MSP) to provide membrane 
proteins in general, and G-protein Coupled Receptors (GPCRs) in particular, with a suitable 

15 environment for homogeneous biochemical assay or crystallization is to have a homogeneous 
preparation of particles. The membrane scaffold proteins we have described, ranging from full 
length human Apo-AI and its derivatives, to alternate membrane scaffold proteins including but 
not limited to, truncated human Apo-AI (t-MSP) where the amino terminal soluble domain has 
been removed, deletion mutants where one or more protein segments are removed and any of the 

20 above materials where a histidine tag is incorporated, primarily form 8-10 mn particles when 
self-assembled with phospholipids in solution. However, upon initial reconstitution, there are 
particles of other sizes present. While standard size separation chromatography can be used to 
purify a single size class of particle, it is preferable to minimize the size distribution of the initial 
reconstitution mixture of target protein, MSP and phospholipid. The 8-10 nm particle is 

25 nominally composed of two MSP, Apo-AI or Apo-AI derivative proteins. Hence, we constructed 
a membrane scaffold protein where two of the truncated Apo-AI proteins (termed MSP1) are 
genetically liked to form a scaffold protein composed of a single polypeptide chain. This is 
schematically illustrated in Fig. 5G. 
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GPCRs which can be solubilized in nanoscale phospholipid bilay ers can include the Class 
A (Rhodopsin-like ) GPCRs, which bind amines, peptides, hormone proteins, rhodopsin, 
olfactory prostanoid, nucleotide-like compounds, cannabinoids, platelet activating factor, 
gonadotropin-releasing hormone, thyrotropin-releasing hormone and secretagogue, melatonin and 
lysosphingolipidandLPA. GPCRs with amine ligands include, without limitation, acetylcholine 
or muscarinic, adrenoceptors, dopamine, histamine, serotonin or octopamine receptors); peptide 
ligands include but are not limited to angiotensin, bombesin, bradykinin, anaphylatoxin, 
Fmet-leu-phe, interleukin-8, chemokine, cholecystokinin, endothelin, melanocortin, neuropeptide 
Y, neurotensin, opioid, somatostatin, tachykinin, thrombin vasopressin-like, galanin, proteinase 
activated, orexin and neuropeptide FF, adrenomedullin (G10D), GPR37/endothelin B-like, 
chemokine receptor-like and neuromedin U. 

Ligands of other specific GPCRs include hormone protein, rhodopsin, olfactory, 
prostanoid, nucleotide-like (adenosine, purinoceptors), cannabinoid, platelet activating factor, 
gonadotropin-releasing hormone, thyrotropin-releasing hormone & secretagogue, melatonin and 
lysosphingolipid & LP A, among others. Class B secretin-like GPCRs include, without 
limitation, those which bind calcitonin, corticotropin releasing factor, gastric inhibitory peptide, 
glucagon, growth hormone-releasing hormone, parathyroid hormone, PACAP, secretin, 
vasoactive intestinal polypeptide, diuretic hormone, EMR1 and latrophilin. Class C metabotropic 
glutamate receptors include those which bind metabotropic glutamate, extracellular 
calcium-sensing or GABA-B, among others. "Orphan" receptors whose ligands are not yet 
known are also potential targets of assays of the present invention. 

In the assays of the present invention which demonstrate binding of a particular ligand 
or which are used to identify inhibitors or competitors of ligand binding to an MSP-supported 
GPCR, a variety of labels can be incorporated within the ligand molecule (such as radioactive 
isotope, e.g., 3 H, 14 C, 35 S, 32 P) or detectable compounds can be attached to the ligand molecule 
provided that binding to the cognate receptor is not significantly reduced due to the label. Labels 
can include, without limitation, 125 1, 131 I, fluorescent compounds, luminescent compounds, etc). 



24 



# 

WO 02/40501 



PCTAJS01/43451 



The necessary properties of the linker sequence between fused MSPs are flexibility and 
solubility so that the fused proteins assemble into particles in a manner similar to two separate 
MSP. Linker sequences consisting of repeats of Gly-Gly-Gly-Ser/Thr- (SEQ ID NO:46) have 
these properties. It is also desirable to minimize the length of the linker. We constructed a 

5 fusion with the minimal linker -GT-, which corresponds to the consensus DNA restriction site 
for Kpn I, as described herein below. The Kpn I site provides an easy way of inserting any 
desired linker sequence by restriction with Kpn I and insertion of double-stranded synthetic DNA 
encoding any desired linker (Robinson et al. 1 998). We have also made a fusion construct with 
the linker sequence -GTGGGSGGGT- (SEQ ID NO:15). The MSP2 with the minimal linker, 

10 however, assembles into particles very similar to particles containing two MSP1 proteins. 

The complete amino acid and nucleic acid sequences for the MSP2 scaffold protein is 
shown in Tables 7 and 8; see also SEQ ID NO: 1 6 and SEQ ID NO : 1 7. The MSP2 fusion protein 
was expressed in E. coli and purified to homogeneity using basically the same procedure as 
described for the single MSPs. The MSP2 protein serves as an effective scaffold protein, 
1 5 self-assembling with phospholipid upon removal of solubilizing detergent. Addressing the point 
of sample heterogeneity, Fig. 13 shows densitometry traces of native gradient polyacrylamide 
gels with the individual peaks labeled as to mean particle diameter in Angstroms. Clearly 
evident, particularly at a lipid/dimer ratio of 200 corresponding to nominally 10 nm particles, is 
the much greater monodispersivity afforded by the MSP2 protein. 

20 Importantly, the overall stability of the disks, as monitored by chemically induced 

unfolding and exposure of tryptophan residues to solvent, is not altered by the fusion of the 
monomeric membrane scaffold proteins, as shown in Fig. 14. 

An important technique used in the characterization of disk structures and associated 
proteins is scanning probe microscopy (SPM). SPM is an umbrella term for any microscope that 
25 utilizes the scanning principles first pioneered in the scanning tunneling microscope (STM), but 
these microscopes can vary so greatly they are best discussed in terms of their guiding central 
principle. The technology has been used in the analysis of biological membranes and their 
associated proteins, bilayer structures and incorporated membrane proteins surfaces. SPM 
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combines independent mobility in all three spatial directions (scanning) with a detection system 
capable of detecting some characteristic of the surface (probing). The various surface 
characteristics that can be probed (conductivity, surface forces, compressibility, capacitance, 
magnetic, fluorescence emission) demonstrate the wealth of information that can be obtained. 

5 The excellent z-axis sensitivity of atomic force microscopy makes the presence of proteins 
binding to an rHDL monolayer easily detectable, as we have shown for well characterized 
integral and anchored membrane proteins immobilized in MSP-supported nanodiscs (Bayburt 
et al., 1 998). Other examples of the usefulness of the precise height measurements possible with 
AFM are our direct quantitation of rHDL particle height (Carlson et al., 1997), and membrane 

10 protein height measurements obtained by modulating the force of the AFM probe on various 
nanodisc assemblies (Bayburt et al., 2000). The surface association of disks formed from MSPs 
allows the utilization of the human apo A-I protein and its improved variants to directly 
investigate the biophysical properties of single membrane proteins incorporated into phospholipid 
bilayers on surfaces by SPM. The ability to attach disks to atomically flat conductive surfaces 

15 (such as gold) is necessary for scanning tunneling microscopy (STM). In theory, tunneling 
through a redox-active system can be used to probe the functional state of an enzyme (Friis et al., 
1999; Mukhopadhyay et al., 2000). These two techniques provide complementary data and can 
be used in concert to create as complete a picture as possible of events occurring at the 
bilayer/solution interface. The ability to place disks on a gold surface also allows the use of 

20 surface plasmon resonance (SPR). Insertion of membrane proteins into such artificial lipid 
bilayers, or their interaction with surface-associated proteins can be detected and quantified by 
SPR. 

Measurements of disk stabilities and determination of size dispersion among classes are 
necessary to evaluate the constructs that are being created. Gel filtration and native gel 
25 electrophoresis are used to separate and quantitate the different size classes. Spectroscopy is used 
to quantitate secondary structure (CD) and lipid association (fluorescence) characteristics of the 
engineered MSPs, including stabilities based on thermal and chemical denaturation. 
Compositions and stoichiometrics of components in disks are quantitated by traditional methods 
(Jonas, 1986). 
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AFM is used to provide molecular resolution data on the structural organizations of the 
lipid and protein components of the systems produced. This technique can be used in air, 
vacuum, and under aqueous and non-aqueous fluids. The latter capability has made it the most 
important scanning probe technique in the biological sciences. The AFM is a very versatile 
instrument as it is capable of acquiring images and other forms of force data in several different 
modes (Sarid, 1994) such as contact, tapping, phase, and lateral force. All of these scanning 
modes are available on the Digital Instruments Multimode Scanning Probe Microscope (Digital 
Instruments, Plainview, N. Y.) and they have been successfully used to image rHDL and proteins 
associated with rHDL layers under biological buffers. This instrument can also be used in STM 
and electrochemical modes to study characteristics of gold-associated constructs and incorporated 
redox proteins. 

As used herein, membrane scaffold proteins are proteins or polypeptides which self 
assemble phospholipids and phospholipid mixtures into nanometer size membrane bilayers. A 
subset of these structures are discoidal in shape and are referred to as nanodiscs or nanodisks. 
Hydrophobic proteins, e.g., membrane proteins, or membrane fragments can associate with these 
particles such that the hydrophobic proteins or membrane fragments are effectively solubilized 
in a stable structure which maintains the functionality of the protein with respect to enzymatic 
activity or ligand binding. These particles are stable in solution or they can be fixed to a surface, 
advantageously in a uniform orientation with respect to the surface. As used herein, a 
nanoparticle comprising MSPs (with or without another hydrophobic or a partially hydrophobic 
protein) can be from about 5 to about 500 nm, desirably about 5 to about 1 00 run, or about 5 to 
about 50 nm in diameter. Nanoparticles (disks) of about 5 to about 15 nm in diameter are 
especially useful. 

We have shown that both MSP 1 and MSP2 assemble with bacteriorhodopsin. From the 
initial reconstitution mixture, two bacteriorhodopsin-containing species are observed when 
particles are formed with MSP1 (Fig. 11) or MSP2 (Fig. 12) in the absence of added 
phospholipid. We found that the MSP is absolutely required for the solubilization of 
bacteriorhodopsin to form these species because omission of an MSP from the formation mixture 
results in large non-specific bacteriorhodopsin aggregates that elute in the void volume of the gel 
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filtration column. The small peak at 15 minutes in Fig. 1 1 represents BR aggregates. In these 
experiments, it appears that the majority of bacteriorhodopsin appears solubilized in the presence 
of MSPs. The two sizes of particles observed are completely consistent with the putative "hinge 
domain" adopting alternate conformations in the structures. From previous work, this flexible 

5 hinge region is believed to consist of helices corresponding to helices 5 and 6 of human Apo-AI 
and thus by inference to MSP1. Thus, in the 9.8 nanometer diameter 
bacteriorhodopsin-contaming particles, these flexible parts of the protein structure appear to be 
associated with the hydrophobic core of the structure while in 7.6 nanometer diameter particles, 
this helical region is dissociated from the hydrophobic core, thus forming a smaller diameter 

10 particle. 

Further modifications of the parent Apo-AI protein can generate more effective and stable 
membrane scaffold proteins. For instance, to increase the homogeneity of the BR/MSP structures 
and to address the issue of the flexible "hinge" region of the protein structure discussed above, 
we have deleted the hinge domain region to produce two new membrane scaffold proteins. In 
15 the first case, putative helical regions 4 and 5 were deleted from the MSP1 Mstidine-tagged 
construct to produce a construct called MSP 1 D5-6. In a second experiment, the putative helices 
5 and 6 were deleted to produce a material termed MSP1D4-5. We have overexpressed these 
proteins in E. coli, which are expressed at high levels upon induction of expression with 
isopropyl-thio-b-D-galactopyranoside in /ac-regulated constructs. 

20 An alternative way of avoiding the formation of multiple particle size classes is to 

engineer MSP constructs so that the hinge domain helices are replaced by helices having higher 
affinities for the hydrophobic core of the particles. In this case, the higher affinity interaction 
disfavors the formation of the smaller species wherein the hinge domain is dissociated. In this 
experimental line, we have chosen to replace the hinge region (helices 5 and 6) with the protein 

25 sequence corresponding to the native sequence corresponding to helices 1 and 2. In another 
manifestation, we have chosen a DNA construct encoding a membrane scaffold protein wherein 
the protein sequence corresponding to the putative helical regions 3 and 4 are used to replace the 
hinge region with a goal of yielding a single size particle upon assembly with bacteriorhodopsin 
and lipid. 
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The so-called "half-repeat" units present in the parent human Apo-AI protein also may 
give rise to conformational heterogeneity in MSP assemblies. For instance, in the picket fence 
model these helices adopt a conformation parallel to the bilayer plane and do not play a major 
role in interactions with the hydrophobic core of the particle as the other regions of the protein 
sequence are envisioned to contribute. In the "belt model" these short helical repeats could give 
rise to segmented mobility allowing the MSP to adopt different conformations. In other words, 
a MSP in which the number of types of structural elements is minimized is most likely desirable 
embodiment of the membrane scaffold protein concept. Thus, in order to further optimize the 
structure of the membrane scaffold protein with respect to their ability to solubilize integral 
membrane protein targets, we can engineer derivative sequences that will delete both half-repeat 
units to produce a simplified MSP structure. 

Monoclonal or polyclonal antibodies, preferably monoclonal, specifically reacting with 
an MSP of the present invention can be made by methods known in the art. See, e.g., Harlow and 
Lane (1988) Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratories; Goding (1986) 
Monoclonal Antibodies: Principles and Practice, 2d ed., Academic Press, New York; and 
Ausubel et al. (1993) Current Protocols in Molecular Biology, Wiley Interscience, New York, 
NY. 

Standard techniques for cloning, DNA isolation, amplification and purification, for 
enzymatic reactions involving DNA ligase, DNA polymerase, restriction endonucleases and the 
like, and various separation techniques are those known and commonly employed by those 
skilled in the art. A number of standard techniques are described in Sambrook et al. (1989) 
Molecular Cloning, Second Edition, Cold Spring Harbor Laboratory, Plainview, New York; 
Maniatis et al. (1 982) Molecular Cloning, Cold Spring Harbor Laboratory, Plainview, New York; 
Wu (ed.) (1 993) Meth. Enzymol. 21 8, Part I; Wu (ed.) (1 979) Meth Enzymol 68; Wu et al. (eds.) 
(1983) Meth Enzymol 100 and 101; Grossman and Moldave (eds.) Meth Enzymol 65; Miller 
(ed.) (1972) Experiments in Molecular Genetics, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, New York; Old and Primrose (1981) Principles of Gene Manipulation, University of 
California Press, Berkeley; Schleif and Wensink (1 982) Practical Methods in Molecular Biology; 
Glover (ed.) (1985) DNA Cloning Vol. I and II, IRL Press, Oxford, UK; Hames and Higgins 
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(eds.) (1 985) Nucleic Acid Hybridization, IRL Press, Oxford, UK; Setlow and Hollaender (1979) 
Genetic Engineering: Principles and Methods, Vols. 1 -4, Plenum Press, New York; and Ausubel 
et al. (1992) Current Protocols in Molecular Biology, Greene/Wiley, New York, NY. 
Abbreviations and nomenclature, where employed, are deemed standard in the field and 
commonly used in professional journals such as those cited herein. 

All references cited in the present application are incorporated by reference herein to the 
extent that there is no inconsistency with the present disclosure. 

The description provided herein is not intended to limit the scope of the invention as 
claimed herein. Any variations in the exemplified articles and methods which occur to the skilled 
artisan are intended to fall within the scope of the present invention. 

EXAMPLES 

Example 1 . Construction of Recombinant DNA Molecules for Expression of MSPs 

The human proapoAI coding sequence as given below was inserted between Nco I and 
Hind HI sites (underlined) in pET-28 (Novagen, Madison, WI). Start and stop codons are in bold 
type. The restriction endonuclease recognition sites used in cloning are underlined. 
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Table 1 . ProApoAI coding sequence (SEQ ID NO : 1 ) Restriction sites used in cloning are 

underlined, and the translation start and stop signals are shown in bold. 

CCAJ^GCCCATTTCTGGCAGCAAGATGAACCCCCCCAGAGCCCCTGGGATCGAGT 

GAAGGACCTGGCCACTGTGTACGTGGATGTGCTCAAAGACAGCGGCAGAGACTAT 

GTGTCCCAGTTTGAAGGCTCCGCCTTGGGAAAACAGCTAAACCTAAAGCTCCTTGA 

CAACTGGGACAGCGTGACCTCCACCTTCAGCAAGCTGCGCGAACAGCTCGGCCCT 

GTGACCCAGGAGTTCTGGGATAACCTGGAAAAGGAGACAGAGGGCCTGAGGCAA 

GAGATGAGCAAGGATCTGGAGGAGGTGAAGGCCAAGGTGCAGCCCTACCTGGAC 

GACTTCCAGAAGAAGTGGCAGGAGGAGATGGAGCTCTACCGCCAGAAGGTGGAG 

CCGCTGCGCGCAGAGCTCCAAGAGGGCGCGCGCCAGAAGCTGCACGAGCTGCAA 

GAGAAGCTGAGCCCACTGGGCGAGGAGATGCGCGACCGCGCGCGCGCCCATGTG 

GACGCGCTGCGCACGCATCTGGCCCCCTACAGCGACGAGCTGCGCCAGCGCTTG 

GCCGCGCGCCTTGAGGCTCTCAAGGAGAACGGCGGCGCCAGACTGGCCGAGTAC 

CACGCCAAGGCCACCGAGCATCTGAGCACGCTCAGCGAGAAGGCCAAGCCCGCG 

CTCGAGGACCTCCGCCAAGGCCTGCTGCCCGTGCTGGAGAGCTTCAAGGTCAGCT 

TCCTGAGCGCTCTGGAGGAGTACACTAAGAAGCTCAACACCCAGTAATAAGCTT-3' 



Table 2. Proapoal amino acid sequence (SEQ ED NO:2) 

MAHFWQQDEPPQSPWDRVKDLATVYVDVLKDSGRDYVSQFEGSALGKQLNLKLLD 
NWDSVTSTFSKLREQLGPVTQEFWDNLEKETEGLRQEMSKDLEEVKAKVQPYLDDF 
QKKWQEEMELYRQKVEPLRAELQEGARQKLHELQEKLSPLGEEMRDRARAHVDAL 
R1HLAPYSDELRQRLAARLEALKENGGARLAEYHAKATEHLSTLSEKAKPALEDLRQ 

GLLPVLESFKVSFLSALEEYTKKLNTQ 

The construction of the MSP1 coding sequence was accomplished as follows. Primers 
were designed to produce DNA encoding MSP1, the truncated protein lacking the N-terminal 
domain of proApoAI, by polymerase chain reaction (PCR) mutagenesis (Higuchi et al., 1988). 
Primer 1 (SEQ ID NO:3) 
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(5'-TATACCATGGGCCATCATCATCATCATCATATAGAAGGAAGACTAAAGCTCCT 
TGACAACT-3') introduces an N-terminal 6-histidine tag for purification and manipulation of 
MSP1, and a factor Xa cleavage site for removal of the histidine tag. Factor Xa cleaves after R 
in the protein sequence IEGR. Primer 2 (SEQ ID NO:4) 
5 (5'-GCAAGCTTATTACTGGGTGTTGAGCTTCTT-3 ') was used as a reverse primer. 



Table 3. Histidine-tagged MSP1 coding sequence (SEQ EDNO:5). Restriction sites used 
in cloning are underlined, and the translation start and stop signals are shown in 
bold. 

TAT ACCATGG GCCATCATCATCATCATCATATAGAAGGAAGACTAAAGCrCCTTGAC 
10 AACTGGGACAGCGTGACCTCCACCITCAGCAAGCTGCGCGAACAGCTCGGCCCTGT 
GACCCAGGAGTTCTGGGATAACCTGGAAAAGGAGACAGAGGGCCTGAGGCAGGA 
GATGAGCAAGGATCTGGAGGAGGTGAAGGCCAAGGTGCAGCCCTACCTGGACGAC 
TTCCAGAAGAAGTGGCAGGAGGAGATGGAGCTCTACCGCCAGAAGGTGGAGCCG 
CTGCGCGCAGAGCTCCAAGAGGGCGCGCGCCAGAAGCTGCACGAGCTGCAAGAG 
1 5 AAGTTGAGCCCACTGGGCGAGGAGATGCGCGACCGCGCGCGCGCCCATGTGGAC 
GCGCTGCGCACGCATCTGGCCCCCTACAGCGACGAGCTGCGCCAGCGCTTGGCC 
GCGCGCCTTGAGGCTCTCAAGGAGAACGGCGGCGCCAGACTGGCCGAGTACCAC 
GCCAAGGCCACCGAGCATCTGAGCACGCTCAGCGAGAAGGCCAAACCCGCGCTC 
GAGGACCTCCGCCAAGGCCTGCTGCCCGTGCTGGAGAGCTTCAAGGTCAGCTTCC 
20 TGAGCGCTCTCGAGGAGTACACTAAGAAGCTCAACACCCAGTAATAAGCTTGC 
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Table 4. Histidine-tagged MSP1 amino acid sequence (SEQ ID N0:6) 
MGHHHHHHffiGRLKLLDNWDSVTSTFSKLREQLGPVTQEFWDNLEKETEGLRQEMS 
KDLEEVKAKVQPYLDDFQKKWQEEMELYRQKVEPLRAELQEGARQKLHELQEKLSP 
LGEEMRDRARAHVDALRTHLAPYSDELRQRLAARLEALKENGGARLAEYHAKATE 
5 HLSTLSEKAKPALEDLRQGLLPVLESFKVSFLSALEEYTBGKLNTQ 

For production of MSP1 without aN-terminal histidine tag, primer 1 was replaced with 
primer la: 5'-TACCATGGCAAAGCTCCTTGACAACTG-3' (SEQ ID NO:7) to produce the 
sequence provided in SEQ ID NO:8. 



Table 5. Non-Histidine-tagged MSP1 DNA sequence (SEQ ID NO: 8). Restriction sites used 
10 in cloning are underlined, and the translation start and stop signals are shown in bold. 

T ACCATGGC AAAGCTCCTTGACAACTGGGACAGCGTGACCTCCACCTTCAGCAAGC 
TGCGCGAACAGCTCGGCCCTGTGACCCAGGAGTTCTGGGATAACCTGGAAAAGGA 
GACAGAGGGCCTGAGGCAGGAGATGAGCAAGGATCTGGAGGAGGTGAAGGCCA 
AGGTGCAGCCCTACCTGGACGACTTCCAGAAGAAGTGGCAGGAGGAGATGGAGC 
1 5 TCTACCGCCAGAAGGTGGAGCCGCTGCGCGCAGAGCTCCAAGAGGGCGCGCGCC 
AGAAGCTGCACGAGCTGCAAGAGAAGTTGAGCCCACTGGGCGAGGAGATGCGCG 
ACCGCGCGCGCGCCCATGTGGACGCGCTGCGCACGCATCTGGCCCCCTACAGCG 
ACGAGCTGCGCCAGCGCTTGGCCGCGCGCCTTGAGGCTCTCAAGGAGAACGGCG 
GCGCCAGACTGGCCGAGTACCACGCCAAGGCCACCGAGCATCTGAGCACGCTCA 
20 GCGAGAAGGCCAAACCCGCGCTCGAGGACCTCCGCCAAGGC 

CTGCTGCCCGTGCTGGAGAGCTTCAAGGTCAGCTTCCTGAGCGCTCTCGAGGAGT 
ACACTAAGAAGCTCAACACCCAGTAATAAGCTTGC 
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Table 6. Non-Histidine-tagged MSP1 amino acid sequence (SEQ ID NO:9). 
MAKLLDNWDSVTSTFSKLREQLGPVTQEFWDNLEKETEGLRQEMSKDLEEVKAKV 
QPYLDDFQKKWQEEMELYRQKVEPLRAELQEGARQKLHELQEKLSPLGEEMRDRA 
RAHVDALRTHLAPYSDELRQRLAARLEALKENGGARLAEYHAKATEHLSTLSEKAK 
5 PALEDLRQGLLPVLESFKVSFLSALEEYTKKLNTQ 

The production of an MSP with tandem repeats (MSP2) was carried at as described 
below. The following primers were used to generate MSP2 (see Figs. 6A-6B): 
Primer 3 (SEQ ID NO:10): S'-TACCATGGCAAAGCTCCTTGACAACTG-S' 

primer 3a (SEQ ID NO:l 1): 
10 5'-TATACCATGGGCCATCATCATCATCATCATATAGAAGGAAGACTAAAGCTCCT 
TGACAACT-3' 

Primer 4 (SEQ ED NO: 12): 

5'-TAAGAAGCTCAACACCCAGGGTACCGGTGGAGGTAGTGGAGGTGGTACCCTA-3' 

Primer 5 (SEQ ID NO: 13): 
15 S'-CAGGGTACCGGTGGAGGTAGTGGAGGTGGTACCCTAAAGCTCCTTGACAA-S' 

Primer 6 (SEQ ID NO: 14): 5'-GCAAGCTTATTACTGGGTGTTGAGCTTCTT-3' 

In a first PCR, primer 2 (or primer 2a for N-terniinal histidine tag) and primer 4 were used to add 
a linker (encoding the amino acid sequence GTGGGSGGGT; SEQ ID NO:15) to the 3' end of 
the MSP gene to produce MSP-A. In a second PCR, the linker was added to the 5' end of the 
20 MSP gene to produce MSP-B. Treatment of MSP-A and MSP-B with Kpnl and subsequent 
ligation produced the following constructs, one with and one without the linker. The Kpn I site 
provides an easy way to inserting any desired linker sequence by restriction with Kpn I and 
religation with double-stranded synthetic DNA encoding desired linker. See Figs. 7A-7B. 
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Table 7. MSP2 (with histidine tag, without long linker) DNA sequence (SEQ ID NO: 16). 
The translation start and stop codons are in bold type, and the restriction endonuclease 
recognition sites used in cloning are underlined. 

TAT ACCATGQG CCATCATCATCATCATCATATAGAAGGAAGACTAAAGCTCCTTGACA 

ACTGGGACAGCGTGACCTCC^CCTTCAGCAAGCTGCGCGAACAGCTCGGCCCTGTGA 

CCCAGGAGTTCTGGGATAACCTGGAAAAGGAGACAGAGGGCCTGAGGCAGGAGATG 

AGCAAGGATCTGGAGGAGGTGAAGGCCAAGGTGCAGCCCTACCTGGACGACTTCCA 

GAAGAAGTGGCAGGAGGAGATGGAGCTCTACCGCCAGAAGGTGGAGCCGCTGCGCG 

CAGAGCTCCAAGAGGGCGCGCGCCAGAAGCTGCACGAGCTGCAAGAGAAGCTGAG 

CCCACTGGGCGAGGAGATGCGCGACCGCGCGCGCGCCCATGTGGACGCGCTGCGCA 

CGCATCTGGCCCCCTACAGCGACGAGCTGCGCCAGCGCTTGGCCGCGCGCCTTGAGG 

CTCTCAAGGAGAACGGCGGCGCCAGACTGGCCGAGTACCACGCCAAGGCCACCGAG 

CATCTGAGCACGCTCAGCGAGAAGGCCAAGCCCGCGCTCGAGGACCTCCGCCAAGG 

CCTGCTGCCCGTGCTGGAGAGCTTCAAGGTCAGCTTCCTGAGCGCTCTCGAGGAGTA 

CACTAAGAAGCTCAACACCCAGGGTACCCTAAAGCTCCTTGACAACTGGGACAGCGTGACCTCC^ 

GCTGCGCGAACAGCTCGGCCCTGTGACCCAGGAGTTCTGGGATAACCTGGAAAAGGAGACAGAGGGCCTGAGGCA 

GGAGATGAGC7\AGGATCTGGAGGAGGTGAAGGCC7VAGGTGC^GCCCTACCTGGACGACTTCCAGAAGAAGTGGC^ 

GGAGGAGATGGAGCTCTACCGCC^GAAGGTGGAGCCGCTGCGeGCAGAGCTCO^AGAGGGCGCGCGCCAGAAGCT 

GCACGAGCTGCAAGAGAAGCTGAGCCCACTGGGCGAGGAGATGCGCGACCGCGCGCGCGCCCATGTGGACGCGCT 

GCGCACGCATCTGGCCCCCTACAGCGACGAGCTGCGCCAGCGCTTGGCCGCGCGCCTTGAGGCTCTC 

AAGGAGAACGGCGGCGCCAGACTGGCCGAGTACC&CGC 

GCCAAGCCCGCGCTCGAGGACCTCCGCCAAGGCCTGCTGCCCGTGCTGGAGAGCTTCAAGGTCAGCTTCCT^ 
GCTCTCGAGGAGTACACTAAGAAGCTCAACACCC^GTAATAAGCTTGC 
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Table 8. MSP2 (with histidine tag, without long linker) amino acid sequence (SEQ ID 
NO:17) 

MGHHHHHHIEGRLKLLDNWDSWSTFSKLR 
KKWQEEMELYRQKVEPLRAELQEGARQKLH^ 
5 LKENGGARIJ^YHAKATEHLSTLSEKAKPALEDLRQGLLPVLESFCT 
TSTFSKLREQL^PVTQEFWDNLEKETEGLRQEMSKD^ 

GARQKLHELQEKLS PLGEEMRDRARAHVDALRTHLM TL 
SEKAKPALEDLRQGLLPVLES FKVS FLSALEE YTKKLNTQ 



Table 9. MSP2 (with histidine tag, with long linker) DNA sequence (SEQ ID NO: 18). 
1 0 Translation start and stop codons are in bold type; restriction endonuclease sites used in cloning 
are underlined. 

T ACCATGG GCC^TC^TCAT<^TCATCaTATAGAAGGAfiGACTAAAGCTCCITGAC^ 
CTGGGAC^GCGTGACCTCC^CCTTCAGCAAGCTGCGCGAACAGCTCGGCCCTGTGAC 
CCAGGAGTTCTGGGATAACCTGGAAAAGGAGACAGAGGGCCTGAGGCAGGAGATGA 

1 5 GC^^GGATCTGGAGGAGGTGAAGGCCAAGGTGCAGCCCTACCTGGACGACTTCCAG 
AAGAAGTGGCAGGAGGAGATGGAGCTCTACCGCCAGAAGGTGGAGCCGCTGCGCGC 
AGAGCTCCAAGAGGGCGCGCGCCAGAAGCTGCACGAGCTGCAAGAGAAGCTGAGCC 
CACTGGGCGAGGAGATGCGCGACCGCGCGCGCGCCCATGTGGACGCGCTGCGCACG 
CATCTGGCCCCCTACAGCGACGAGCTGCGCCAGCGCTTGGCCGCGCGCCTTGAGGCT 

20 CTCAAGGAGAACGGCGGCGCCAGACTGGCCGAGTACCACGCCAAGGCCACCGAGCA 
TCTGAGCACGCTCAGCGAGAAGGCCAAGCCCGCGCTCGAGGACCTCCGCCAAGGCC 
TGCTGCCCGTGCTGGAGAGCTTCAAGGTCAGCTTCCTGAGCGCTCTCGAGGAGTACA 
CTAAGAAGCTCAACACCCAGGGTACCGGTGGAGGTAGTGGAGGTGGTACCCTAAAG 
CTCCTTGACAACTGGGACAGCGTGACCTCCACCTTCAGCAAGCTGCGCGAACAGCTC 

25 GGCCCTGTGACCCAGGAGTTCTGGGATAACCTGGAT^AAGGAGACAGAGGGCCTGAG 
GCAGGAGATGAGCAAGGATCTGGAGGAGGTGAAGGCCAAGGTGCAGCCCTACCTGG 

ACGACTTCCAGAAGAAGTGGCAGGAGGAGATGGAGCTCTACCGCCAGAAGGTGGAGCCGCTGCGCGCAGAGCTCC 
AAGAGGGCGCGCGCCAGAAGCTGCACGAGCTGCAAGAGAAGCTGAGCCCACTGGGCGAGGAGATGCGCGACCGCG 
CGCGCGCCCATGTGGACGCGCTGCGCACGCATCTGGCCCCCTACAGCGACGAGCTGCGCCAGCGCTTGGCCGCGC 
30 GCCTTGAGGCTCTCAAGGAGAACGGCGGCGCCAGACTGGCCGAGTACCACGCCAAGGCCACCGAGCATCTGAGCA 
CGCTCAGCGAGAAGGCCTU^GCCCGCGCTCGAGGACCTCCGCCAAGGCCTGCTGCCCGTGCTGGAGAGCTTCAAG^ 
TCAGCTTCCTGAGCGCTCTCGAGGAGTACACTAAGAAGCTCAACACCCAGTAATAAGCTTGC 
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Table 10. MSP2 (with histidine tag, with long linker, in bold type) amino acid sequence (SEQ 
IDNO:19). 

MGHHHHHHIEGRLKLLDNWDSVTSTFSKLREQLGPVTQEFWDNLEKETEGLRQEMS 
KDLEEVKAKVQPYLDDFQKKWQEEMELYRQKVEPLRAELQEGARQKLHELQEKLSP 
LGEEMRDRARAHVDALRTHLAPYSDELRQRLAAIO.EALKENGGARLAEYHAKATE 
HLSTLSEKAKPALEDLRQGLLPVLESFKVSFLSALEEYTKKLNTQGTGGGSGGGTLK 
LLDNWDSVTSTFSKLREQLGPVTQEFwTJNLEKETEGLRQEMSKDLEEVKAKVQPYL 
DDFQKKWQEEMELYRQKVEPLRAELQEGARQKLHELQEKLSPLGEEMRDRARAHV 
DALRTHLAPYSDELRQRLAARLEALKENGGARLAEYHAKATEHLSTLSEKAKPALED 
LRQGLLPVLESFKVSFLSALEEYTKKLNTQ 

To delete hinge regions, deletion of helices 4 and 5 was carried out by constructing the 
C^erminal portion of MSP1 using the following PCR primers and the Sac I and Hind HI 
fragment of the MSP1 coding sequence as template. 

Primer A (SEQ ID NO:20): 

5'-TGGAGCTCTACCGCCAGAAGGTGGAGCCCTACAGCGACGAGCT-37 

Primer B (SEQ ID NO:21): S'-GCAAGCTrATTACTGGGTGTTGAGCTTCTT-S'. 

This amplification product was digested with SacI and Hindm and ligated into pLitmus 28 for 
sequencing. The Sac I +Hindin treated histidine-tagged MSP1 construct in pET 28 vector was 
then ligated with the above fragment to produce MSP1D4-5. 
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Table 11. MSP1D4-5 (helices 4 and 5 deleted) DNA sequence (SEQ IDNO:22). 
Translations start and stop codons are in bold type; restriction endonuclease recognition sites are 
underlined. 

TAT ACCATGGG CCATCATCATCATCATCATATAGAAGGAAGACTAAAGCTCCTTG 
ACAACTGGGACAGCGTGACCTCCACCTTCAGCAAGCTGCGCGAACAGCTCGGCC 
CTGTGACCCAGGAGTTCTGGGATAACCTGGAAAAGGAGACAGAGGGCCTGAGGC 
AGGAGATGAGCAAGGATCTGGAGGAGGTGAAGGCCAAGGTGCAGCCCTACCTGG 
ACGACTTCCAGAAGAAGTGGCAGGAGGAGATGGAGCTctaccgccagaaggtggagcCCTA 
CAGCGACGAGCTGCGCCAGCGCTTGGCCGCGCGCCTTGAGGCTCTCAAGGAGAA 
CGGCGGCGCCAGACTGGCCGAGTACCACGCCAAGGCCACCGAGCATCTGAGCAC 
GCTCAGCGAGAAGGCCAAACCCGCGCTCGAGGACCTCCGCCAAGGCCTGCTGCC 
CGTGCTGGAGAGCTTCAAGGTCAGCTTCCTGAGCGCTCTCGAGGAGTACACTAAG 
AAGCTCAACACCCAGTAATAAGCTTGC 



Table 12. MSP1D4-5 (helices 4 and 5 deleted) amino acid sequence (SEQ ID NO:23). 
MGHHHHHHIEGPXKLLDNWDSVTSTFSKLPJEQLGPVTQEFWDNLEKETEGLRQEMS 
KDLEEVKAKVQPYLDDFQKKWQEEMELYRQKVEPYSDELRQPxLAARLEALKENGG 
APXAEYHAKATEHLSTLSEKAKPALEDLRQGLLPVLESFKVSFLSALEEYTKKLNTQ 

Deletion of helices 5 and 6 was performed in a similar manner, but two separate PCR 
steps using the following primers were employed in a first reaction (Reaction 1, Primer C: 
5'-CAGAATTCGCTAGCCGAGTACCACGCCAA-3', SEQ ID NO:24; and Primer D: 
5'-GCAAGCTTATTACTGGGTGTTGAGCTTCTT-3', SEQ ID NO:25) and a second reaction 
(Reaction 2, Primer E: 5'-ATACCATGGGCCATCATCATCATCATCATA-3', SEQ ID NO:26; 
and Primer F: 5'-CAGAATTCGCTAGCCTGGCGCTCAACTTCTCTT-3^ SEQ IDNO:27. 
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The PCR products encode the N- and C- tenninal portions of MSP both lacking helices 
5 and 6 and each contain a Nhel restriction site. After digestion of the PCR products with Nhe 
I, Ncol and Hindin, the fragments was ligated into Ncol+Hindm treated pET 28 to produce the 
DNA sequence of MSP1D5-6 lacking helices 5 and 6. See Figs. 9A-9B. 



5 Table 13. MSP1D5-6 (helices 5 and 6 deleted) DNA sequence (SEQ IDNO:28). 
Translation start and stop codons are shown in bold type, and restriction endonuclease 
recognition sites used in cloning are underlined. 

TATACCAlSeGCCATCATCATCATCATCATATAGAAGGAAGACTAAAGCTCCTTGA 
CAACTGGGACAGCGTGACCTCCACCTTCAGCAAGCTGCGCGAACAGCTCGGCCCTG 

10 TGACCCAGGAGTTCTGGGATAACCTGGAAAAGGAGACAGAGGGCCTGAGGCAGG 
AGATGAGCAAGGATCTGGAGGAGGTGAAGGCCAAGGTGCAGCCCTACCTGGACG 
ACTTCCAGAAGAAGTGGCAGGAGGAGATGGAGCTCTACCGCCAGAAGGTGGAGC 
CGCTGCGCGCAGAGCTCCAAGAGGGCGCGCGCCAGAAGCTGCACGAGCTGCAAG 
AGAAGTTGAGCGCCAGGCTAGCCGAGTACCACGCCAAGGCCACCGAGCATCTGA 

1 5 GCACGCTCAGCGAGAAGGCCAAACCCGCGCTCGAGGACCTCCGCCAAGGCCTGC 
TGCCCGTGCTGGAGAGCTTCAAGGTCAGCTTCCTGAGCGCTCTCGAGGAGTACAC 
TAAGAAGCTCAACACCCAGTAATAAGCTTGC 



Table 14. MSP1D5-6 (helices 5 and 6 deleted) amino acid sequence (SEQ ID NO:29). 
MGHHHHHfflEGPXKLLDNWDSVTSTFSKLREQLGPVTQEFWDNLEKETEGLRQEMS 
20 KDLEEVKAKVQPYLDDFQKKWQEEMELYRQKVEPLRAELQEGARQKLHELQEKLS 
ARLAEYHAKATEHLSTLSEKAJCPALEDLRQGLLPVLESFKVSFLSALEEYTKKLNTQ 
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Example 2. Construction of synthetic MSP gene. 

A synthetic gene for MSP1 is made using the following overlapping synthetic 
oligonucleotides which are filled in using PCR. The codon usage has been optimized for 
expression in E. coli, and restriction sites have been introduced for further genetic manipulations 
of the gene. 

Synthetic nucleotide tapsla (SEQ ID NO:30) 

TACCATGGGTCATCATCATCATCATCACATTGAGGGACGTCTGAAGCTGTT 
GGACAATTGGGACTCTGTTACGTCTA 

Synthetic nucleotide taps2a (SEQ ED NO:31) 

AGGAATTCTGGGACAACCTGGAAAAAGAAACCGAGGGACTGCGTCAGGA 
AATGTCCAAAGAT 

Synthetic nucleotide taps3a (SEQ ID NO:32) 

TATCTAGATGACTTTCAGAAAAAATGGCAGGAAGAGATGGAATTATATCG 
TCAA 

Synthetic nucleotide taps4a (SEQ ID NO:33) 

ATGAGCTCCAAGAGAAGCTCAGCCCATTAGGCGAAGAAATGCGCGATCGC 
GCCCGTGCACATGTTGATGCACT 

Synthetic nucleotide taps5a (SEQ ID NO:34) 

GTCTCGAGGCGCTGAAAGAAAACGGGGGTGCCCGCTTGGCTGAGTACCAC 
GCGAAAGCGACAGAA 

Synthetic nucleotide taps6a (SEQ ID NO:35) 

GAAGATCTACGCCAGGGCTTATTGCCTGTTCTTGAGAGCTTTAAAGTCAGT 
TTTCT 
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Synthetic nucleotide taps lb (SEQ ID NO:36) 

CAGAATTCCTGCGTCACGGGGCCCAOTTGTTCGCGAAGTTTACTGAAGGT 
AGACGTAACAG 

Synthetic nucleotide taps2b (SEQ ID NO:37) 

TCATCTAGATATGGCTGAACCTTGGCCTTCACCTCTTCTAAATCTTTGGAC 
ATTT 

Synthetic nucleotide taps3b (SEQ ID NO:38) 

TGGAGCTCATGGAGTTTTTGGCGTGCCCCCTCTTGCAGTTCCGCACGCAGC 
GGTTCCACCTTTTGACGATATAATTCCAT 

Synthetic nucleotide taps4b (SEQ ID NO:39) 

GCCTCGAGACGTGCGGCCAAACGCTGGCGAAGTTCATCCGAATACGGCGC 
CAAATGAGTCCGGAGTGCATCAACAT 

Synthetic nucleotide taps5b (SEQ ID NO:40) 

GTAGATCTTCCAGCGCCGGTTTCGCTTTTTCGCTCAAGGTGCTCAGGTGTT 
CTGTCGCTTT 

Synthetic nucleotide taps6b (SEQ ID NO:41) 

CCAAGCTTATTACTGGGTATTCAGCTTTTTAGTATATTCTTCCAGAGCTGA 
CAGAAAACTGACTTT 
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Table 15. Full synthetic gene sequence for MSP1 (SEQ ID NO:42). Restriction sites used 
in cloning are underlined, and the translation start and stop signals are shown in bold. 



A£CATGGGTCATCATCATCATCATCACATTGAGGGACGTCTGAAGCTGTTGGACAA 
TTGGGACTCTGTTACGTCTACCTTCAGTAAACTTCGCGAACAACTGGGCCCCGTGAC 
GCAGGAATTCTGGGACAACCTGGAAAAAGAAACCGAGGGACTGCGTCAGGAAATG 
TCCAAAGATTTAGAAGAGGTGAAGGCCAAGGTTCAGCCATATCTAGATGACTTTCA 
GAAAAAATGGCAGGAAGAGATGGAATTATATCGTCAAAAGGTGGAACCGCTGCG 
TGCGGAACTGCAAGAGGGGGCACGCCAAAAACTCCATGAGGTCCAAGAGAAGCT 
CAGCCCATTAGGCGAAGAAATGCGCGATCGCGCCCGTGCACATGTTGATGCACT 
CCGGACTCATTTGGCGCCGTATTCGGATGAACTTCGCCAGCGTTTGGCCGCACGT 
CTCGAGGCGCTGAAAGAAAACGGGGGTGCCCGCTTGGCTGAGTACCACGCGAAA 
GCGACAGAACACCTGAGCACCTTGAGCGAAAAAGCGAAACCGGCGCTGGAAGAT 
CrACGCCAGGGCTTATTGCCTGTTCTTGAGAGCTTTAAAGTCAGTTTTCTGTCAGC 
TCTGGAAGAATATACTAAAAAGCTGAATACCCAGTAATAAGCTTGG 



The following is the amino acid sequence of a MSPp polypeptide in which half repeats 
are deleted: 



Table 1 6. MSP with first half-repeat deleted (MSPldeltal) (SEQ ID NO:43) 
MGHHHHHfflEGRLKLLDNWDSVTSTFSKLR 

PYLDDFQKKWQEEMELYRQKVEPLRAELQEGARQKLHELQEKLSPLGEEMRDRAR 
AHVDALRTHLAPYSDELRQRLAARLEALKENGGARLAEYHAKATEHLSTLSEKA^ 
ALEDLRQGLLPVLESFKVSFLSALEEYTKKLNTQ 
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Table 17. MSP with second half-repeat deleted (MSPldelta2) (SEQ ID NO:44) 

MGHHHHHHIEGRLKLLDNWDSVTSTFSKLREQLGPVTQEFWDNLEK^ 
IOXJEEVKAKVQPYLDOT 

PLGEEMRDRARAHVDALRTHLAPYSDELRQRLAARLEA^ 
EHLSTLSEKAKPVLESFKVSFLSALEEYTKKLNTQ 

Table 18. MSP tandem repeat with first half-repeats deleted (MSP2deltal) (SEQ ID NO:45) 
MGHHHHHfflEGRLKLLDNW^ 

PYLDDFQKKWQEEMELYRQKVEPLRAELQEGARQKLHELQEKLSPLGEEMR^ 

AHVDALRTHLAPYSDELRQRLAARLEALKENGGARLA 

ALEDLRQGLLPVLESFKVSFLSALEEYTKKLNTQGTLKLLDNWDSVTST^ 

GPVTQEFWDNLEKETEGLRQEMSPYLDDFQKKWQEEMELYRQKVEPLRAELQEGA 

RQKLHELQEKLSPLGEEMRDRARAHVDALRTHLAPYSDELRQRLAARL 

ARLAEYHAKATEHLSTLSEKAKPALEDLRQGLL 

Other constructs that can be readily produced include permutations of the above, i.e. 
MSP1 or MSP2 with any combination of the following: hinge deletion, hinge replacement, 
half-repeat deletion, histidine tag, different linkers for MSP2. 

Example 3. Expression of Recombinant MSPs 

To express MSP proteins, the nucleic acid constructs were inserted between the Ncol and 
Hindlll sites in the pET28 expression vector and transformed into E. coli BL21(DE3). 
Transformants were grown on LB plates using kanamycin for selection. Colonies were used to 
inoculate 5 ml starter cultures grown in LB broth containing 30 (ag/ml kanamycin. For 
overexpression, cultures were inoculated by adding 1 volume overnight culture to 100 volumes 
LB broth containing 30 ng/ml kanamycin and grown in shaker flasks at 37 °C. When the optical 
density at 600 nm reached 0.6-0.8, isopropyl b-D-thiogalactopyranoside (IPTG) was added to a 
concentration of 1 mM to induce expression and cells were grown 3-4 hours longer before 
harvesting by centrifugation. Cell pellets were flash frozen and stored at -80 °C. 
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Example 4. Purification of Recombinant MSPs 

Purification of histidine-tagged MSPs was carried out as follows. A frozen cell pellet 
from 1 liter of expression culture was resuspended in 25 milliliters of 20 mM Tris HC1 pH 7.5 
containing 1 mM phenylmethylsulfonyl fluoride. Triton X-100 
(t-octylphenoxypolyethoxyethanol) was added from a 1 0% (w/v) stock in distilled H20 to a final 
concentration of 1 %. The resuspended cells were sonicated on ice at 50% duty cycle at a power 
setting of 5 for four cycles of 1 minute on, 5 minutes off with a Branson probe sonifier. The 
resulting lysate was centrifuged for 30 minutes at 30,000 rpm in a Beckman Ti 45 rotor in a 
ultracentrifuge. The resulting supernatant was filtered through a 0.22 mm nylon syringe filter. 
The salt concentration was adjusted to 0.5 M from a 4 M NaCl stock in water and applied to a 
5 ml Hi-Trap nickel loaded column (Pharmacia, Piscataway, NJ). 

For 6H-MSP1, the column is washed with 20 ml buffer (10 mM Tris pH 8, 0.5 MNaCl) 
containing 1% Triton X-l 00, followed by 20 ml buffer + 50 mM sodium cholate, and then 20 ml 
buffer and 20 ml 100 mM imidazole in buffer. The His-tagged polypeptide is eluted with 1 5 ml 
0.5 M imidazole in buffer. 

For 6H-MSP2, the column is washed with 20 ml buffer (10 mM Tris pH 8, 0.5 M NaCl) 
containing 1% Triton X-100; 20 ml buffer + 50 mM cholate; 20 ml buffer; 20 ml 35 mM 
imidazole in buffer. The His-tagged polypeptide is then eluted with 15 ml 0.5 M imidazole in 
buffer, and the purified protein is dialyzed against 1 0 mM Tris pH 8, 0. 1 5 M NaCl using a 1 0,000 
MW cutoff cellulose dialysis membrane. 

Example 6. Production of MSP-containing Nanoscale Particles 

To reconstitute MSP proteins of the present invention with lipid, purified MSP was 
concentrated in a pressurized ultrafiltration device (Amicon) using a 10,000 MW cutoff filter to 
-2-6 mg protein/ml. Concentration of protein was determined by bicinchonic acid assay (Pierce 
Chemical, Rockford, IL) or measurement of A280 using theoretical absorption coefficient. 
Phospholipid (dipalmitoyl phosphatidylcholine in this case, however different 
phosphatidylcholines and mixtures of phosphatidylcholine and other lipids can be used) in 
chloroform stock solution was dried under a stream of nitrogen and placed in vacuo overnight. 
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Phosphate analysis was performed to determine the concentration of chloroform stock solutions. 
The dried lipid film was resuspended in buffer lOmMTrisHCipH 8.0 orpH7.5 containing 0.15 
M NaCl and 50 mM sodium cholate to give a final lipid concentration of 25 mM. The 
suspension was vortexed and heated to 50 °C to obtain a clear solution. Phospholipid solution 
was added to solution of MSP (2-6 mg/rol protein) to give molar ratios for MSP1 :lipid of 2:200 
and for MSP2 of 1 :200. The mixture was incubated overnight at 37 ° C and then dialyzed against 
1000 volumes of buffer without cholate with 4 changes of buffer over 2-3 days. 

Example 7. Tethered Membrane Protein Incorporation 

Tissue Factor (TF) isarepresentativetetheredmembrane protein. Inorderto demonstrate 
the value of MSP technology for a tethered membrane protein, recombinant human TF was 
incorporated into MSP-supported nanodiscs. The recombinant protein consists of an 
extracellular domain, the transmembrane anchor and a truncated cytosolic domain. The 
truncation increases the homogeneity of the protein by removing the C-terminal portions of the 
protein which are subject to proteolysis by bacterial enzymes. This modification does not affect 
TF activity. Additional modifications to the protein include an N-terminal trafficking peptide 
and an HPC4 epitope tag. The trafficking peptide directs the expressed protein to the 
intermembrane space of the recombinant E. coli host cell, in which space the peptide sequence 
is cleaved. The HPC4 epitope allows for affinity purification with Ca 2+ dependent antibody 
(Rezaie et al., 1992) and does not affect TF activity. 

A 25 mM lipid mixture containing 80% phosphatidyl choline and 20% phosphatidyl 
serine was solubilized with 50 mM cholate in 10 mM Tris CI, 150 mM NaCl at pH 8.0. TF, 
MSP 1 and lipid (in a ratio of 1 : 10: 1 000) were combined and incubated overnight at 37°C. The 
sample was then dialyzed at 37°C (10,000 dalton molecular weight cutoff membrane) against 
buffer containing lOmMTrisCl, 150mMNaClatpH8.0(lackingcholate)for2hours. Dialysis 
was then continued at 4 °C for an additional 6 hours with buffer changes every 2 hours. The 
approximately 1 ml sample was then concentrated to <250 \A using a YM-10 centrifuge 
concentrator and injected into a Pharmacia 10/30 Superdex 200 HR gel filtration column. 
Samples were eluted with buffer identical to that described above (no cholate) at 0.5 ml per 
minute. Fractions from chromatography were run on an 8-25% gradient SDS polyacrylamide gel 
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to determine apparent size and then checked for coagulation activity. The chromatogram 
showing elution of TF incorporated into an excess population of MSP 1 nanodiscs is shown in 
Fig. 16A-16B. 

The activity of TF in several disk fractions was determined by coagulation assays with 
human serum. Activity was monitored in fractions 25-28 as the inverse of coagulation time. 
Activity was highest in fraction 25 at 40 hr* 1 and decreased through fraction 28 at 30 hr" 1 . This 
is expected from the size chromatogram in that the leading edge of the nanodisc peak has a larger 
effective mass due to the incorporation of TF in the MSP-supported bilayer. This assay thus 
demonstrates that TF is incorporated into nanodiscs in an active conformation and that the 
membrane environment of the nanodisc closely mimics that of the native membrane system. 

Cytochrome b5 is a membrane anchored heme protein having a single membrane anchor 
domain that penetrates the membrane bilayer. Cytochrome b5 solubilized from its native 
membrane exists as large aggregates in the absence of detergent and appears as a smear rather 
than a discrete band on native polyacrylamide gel electrophoresis. Formation of nanodiscs 
through a self-assembly process wherein cytochrome b5 is added to the preparation of MSP and 
phospholipid results in the incorporation of cytochrome b5 into nanodisc structures. This is 
verified by the intense heme staining of the band corresponding to nanodiscs in the right panel 
of See Fig. 17B, lane 4. Cytochrome b5-containing nanodiscs separated by anion exchange 
chromatography are shown in lanes 5 and 6 of Fig. 17B. Two peaks elute from the anion 
exchange column near 310 raM NaCl and near 370 NaCl. Discs alone elute near 310 mM NaCl, 
and cytochrome b5 alone elutes between 450 and 700 mM NaCl. These data show that 
cytochrome b5 can be successfully solubilized using MSP technology and that disc complexes 
containing cytochrome b5 can be chromatographically separated and purified away from the 
undesired aggregated material. The optical absorption properties of the heme chromophore of 
the purified material demonstrate that the heme active site in a native conformation. 

Nanodiscs can also be formed by mixing 20 \xl of apo A-I (1 0 mg/ml), 6.6 |il cytochrome 
b5 (0.5 mM) and 50 ^leggphosphatidylcholine/sodiumcholate(11.2eggPC 5 6.2 mg/ml sodium 
cholate), incubating overnight at 4 ° C , followed by dialyzing to remove cholate. Purification was 
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accomplished using a Pharmacia MonoQ FPLC anion exchange column equilibrated in 25 mM 
Tris CI, pH 8.0. A linear gradient was run at 0.5 ml/min from 0-1 M NaCl in 20 min. 

Example 8. Embedded Membrane Protein Incorporation 

Cytochrome P450 2B4 from rabbit liver microsomes, cytochrome P450 3 A4 found in 
nature in human liver microsomes and cytochrome P450 6B1 from insect microsomes are 
representative of embedded membrane proteins. 

Cytochrome P450 2B4 was isolated from rabbit liver microsomes after induction with 
phenobarbital. Formation of 2B4 nanodiscs is as follows. Cytochrome P450 2B4 was 
reconstituted into disks by the detergent dialysis method. The buffer consisted of 10 mM Tris- 
HC1 pH 8.0, 0. 1 M NaCl, 1 0% (v/v) glycerol. The mixture of apo A-I, cholate and phospholipid 
(1 :220: 110 mole ratio) was incubated for 8 hours at 37°C followed by addition of P450 (1 : 0.5, 
apo A-LP450 mole ratio) and incubation overnight at room temperature. The mixture was 
dialyzed using a 10,000 MW cutoff slide-a-lyzer (Pierce Chemical Co., Rockford, IL) at room 
temperature for two hours followed by a change of buffer and continued dialysis at 4°C. It was 
found that 82% of the P450 content could be recovered under these conditions. After dialysis, 
the mixture was injected onto a Superdex 200 HR10/30 gel filtration column (Pharmacia, 
Uppsala, Sweden) equilibrated in reconstitution buffer at room temperature at a flow rate of 0.25 
ml/minute with collection of 0.5 ml fractions. Fractions were assayed using native 
polyacrylamide gradient gel electrophoresis on 8-25% gradient native gels and Coomassie 
staining using the Phastgel system (Pharmacia, Uppsala, Sweden). 

Human cytochrome P450 3A4, normally from liver microsomes, has also been cloned, 
expressed in £. coli and purified and incorporated into MSP-supported bilayer nanodiscs. 10 
nanomoles of MSP2, onemicromole of lipid, 5 nanomoles of cytochrome P450 3A4 protein and 
2 micromoles cholic acid were incubated together at 37°C for 2 hours. The incubated mixture 
was then dialyzed in a 1 OK Slide- A-lyzer Dialysis Cassette (Pierce Chemical Co., Rockford, IL). 
The dialysis was carried out with 10 mM potassium phosphate (pH 7.4) 1 50 mM NaCl buffer. 
The sample was dialyzed at 37 °C for 6 hours followed by a buffer change, and dialysis continued 
at 4°C with two buffer changes at 12 hour intervals. The samples were then fractionated on a 
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Superdex 200 HR 1 0/30 column (Pharmacia, Uppsala, SE) equilibrated in dialysis buffer at room 
temperature at a flow rate of 0.5 ml/min. 

The four graphs (Figs. 1 8-20) show that the retention times of the cytochrome P450 3 A4 
(observed by absorbance at 417 ran) and the nanodiscs (monitored at 280 nm where both MSP 
5 and the 3 A4 protein absorb) elute from the column at the same time, at approximately 24 min. 
This elution time also correlates closely to the calculated retention time of the disc protein 
complex. Further evidence that supports this is a native polyacrylamide gel electropherogram 
that directly measures the size of the eluted particles (Fig. 21). 

Cytochrome P450 6B1 is another model embedded membrane protein. This cytochrome 
10 has been isolated from Papilio polyxenes, the black swallowtail. These butterflies feed 
exclusively on plants producing furanocoumarins, plant metabolites that are phototoxic to most 
organisms. Cytochrome 6B1 catalyzes the detoxification of furanocoumarins. 

In order to show a new utility of the MSP methodology of the present invention, we 
demonstrated that isolated membranes containing their repertoire of membrane proteins could 

15 be sued as a source for incorporating membrane proteins into nanodiscs. An important 
illustrative embodiment is the use of the common insect cell (Sf9)-baculovirus expression system 
which is used widely as a heterologous expression system. Thus, we used an insect cell line co- 
infected such that a microsomal preparation containing overexpressed insect 6B1 and also 
overexpressed insect NADPH cytochrome P450 reductase. In these experiments we not only 

20 demonstrate that MSP nanodiscs can be used to incorporate another cytochrome P450 system into 
soluble monodisperse particles but also that the source of this P450 could be simply whole 
membranes containing this protein. 

A standard baculovirus expression system was used to obtain microsomal preparations 
with overexpressed insect cytochrome 6B1 and insect NADPH P450 reductase. Based on the 
25 lipid concentration contained in the microsomal preparations, MSP technology was used to 
assemble microsomal proteins into nanoparticle discs using a ratio of 110:1:220 
lipid:MSPl :cholate. The microsomal sample was detergent solubilized with cholate and mixed 
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withMSPl. The sample was incubated at 4°C for 2 hours. The detergent can be removed by * 
i dialysis or hydrophobic beads. In this experiment BIOBEADS (hydrophobic beads, BioRad, 
Hercules, CA) were added in excess (0.25 g per 1 ml disc mixture) and incubated for 2 hours at 
4°C for 2 hours to remove detergent. The sample was removed from the beads and the His 6 - 
5 tagged MSP was isolated by using a batch purification method with Ni 2+ resin. The MSP discs 
were then isolated by Superdex sizing column chromatography (Fig. 22). Incorporation of P450 
into the His 6 -tagged discs was followed by CO difference spectroscopy of nickel affinity column 
purified and sizing column-purified fractions (Fig. 24). SDS-PAGE was performed to verify 
incorporation of cytochrome P450 6B1 into discs (Fig. 23). 

10 The endogenous (natural) ratio of cytochrome P450 to reductase is about 10-20. To 

obtain activity of the cytochrome P450 6B1 after reconstitution into discs, it is preferred that an 
excess of reductase be added to the reconstitution mixture, such that a P450 molecule and 
reductase molecule both partition into a single disc. Supplementation of the microsomal 
preparation with exogenously added reductase has been successfully demonstrated. 

15 The protocol for making discs using microsomal preparations was used with one 

modification. Exogenous rat reductase was added after the solubilization step of the microsomal 
preparation with cholate and before the addition of MSP 1 . Otherwise identical disc assembly and 
purification procedures were followed. The sample was separated by a Superdex sizing column, 
where absorbance at 280 nm indicates the presence of MSP 1, absorbance at 420 and 456 nm 

20 indicates the presence of ferric species, and absorbance at 456 nm also indicates presence of 
reductase. A ratio plot of 456 to 420 nm was made; it showed positions on the chromatogram 
where the absorbance at 456 nm was above that associated with cytochrome P450 6B1 and, 
therefore, could be attributed to absorbance by reductase. Retention times reflected the presence 
of 10 nm particles containing cytochrome P450 6B1 and reductase (Fig. 26). 

25 MSP-supported nanodiscs with purified proteins, membrane fragments or disrupted 

membranes can be used in high throughput screening ventures, for example, to identify new 
pharmaceuticals and other biologically active molecules. 
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Example 9. Integral Membrane Protein Incorporation 

Bacteriorhodopsin is a model integral membrane protein. Bacteriorhodopsin was 
incorporated into nanoscale structures using the following procedure, which is a protocol useful 
for other proteins as well Bacteriorhodopsin was obtained as lyophilized purple membrane from 

5 Sigma (St. Louis, MO). 1 mg BR was suspended in 1 ml 25 mM potassium phosphate pH 6.9. 
1 ml 90 mM n-octyl B-D-glucopyranoside in the same buffer was added and the sample placed 
in the dark at 24 0 C overnight. This treatment produces a detergent-solubilized monomeric form 
(Dencher et al., 1982). BR was quantitated assuming a molar extinction coefficient at 550 run 
of 63,000. BR (7.8 uM) was mixed with MSP1 (97mM)orMSP2(110mM)andcholate(50 

10 mM) to give final molar ratios of MSP1:BR of 10:1 or MSP2:BR of 5:1 and a cholate 
concentration of approximately 8 mM. For reconstitution with phospholipid, the lipid is 
solubilized as above in the presence of 50 mM cholate and mixed with MSP1 at a mole ratio of 
1 MSP1:110 lipids:0.1 BR. The mixture was incubated at room temperature for ~3 hours 
followed by dialysis overnight against 1000 volumes of buffer using 10,000 MW cutoff dialysis 

15 devices (Slide-a-lyzer, Pierce Chemical). Dialysis was continued at 4 degrees for 2 days with 
several changes of buffer. 10 mMHEPES,pH 7.5, 0.15 MNaCl buffer can be used. Tris buffer 
pH 7.5 or pH 8 has also been used successfully. 

The 5-hydroxytryptamine 1A G protein coupled receptor from human has been 
incorporated into MSP-containing nanoparticles. A commercially available insect cell expression 
20 system that provides a membrane fraction containing the human 5-hydroxytryplamine 1AGPCR 
was supported using MSP compositions. Briefly, the 5-HT receptor containing membrane 
preparation was mixed with phospholipids (phosphatidyl choline, phosphatidyiethanolamine, 
phosphatidyl serine) at a ratio of 45 :45 : 1 0, MSP 1 and cholate. ■ 

5-HT1 A receptors overexpressed in a commercially available Sf9 insect cell membrane 
25 preparation (Sigma Chemical Co., St. Louis, MO) were solubilized using the following protocol. 
POPC, POPS and POPE (Avanti Phospholipids) in chloroform were mixed in a 45:10:45 mole 
ratio and dried down under a stream of nitrogen, then placed under vacuum for several hours to 
remove residual solvent. The phospholipids were dispersed in 50 mM Tris pH 7.4, 0.2 MNaCl, 
50 mM sodium cholate buffer at a concentration of 25 mM phospholipid. Five microliters of the 

50 



WO 02/40501 



PCT/US01/43451 



sf9 membrane preparation (0.2 mg/ml protein), 1.62 microliters of phospholipid in buffer, 2.4 
microliters of MSP1 (4.2 mg/ml) and 0.28 microliters 4 MNaCl were mixed and left for 1 hour 
on ice. The mixture was diluted to 100 microliters total volume with 50 mM Tris pH 7.4 and 
dialyzed in a mini slide-a-lyzer (Pierce Chemical) against 50 mM Tris pH 7.4 at 4 °C (two one- 

5 liter changes of buffer). To determine the amount of 5HT1 A receptor associated with nanodisks, 
a radiolabeled ligand was bound to the receptor and disk-receptor-ligand complexes were isolated 
using the 6-histidine tag present in the MSP1 according to the following protocol. After dialysis, 
the mixture was diluted to 200 microliters total volume with 50 mM Tris pH 7.4. Ninety-five 
microliters of the diluted mixture were placed into each of two tubes. One hundred five 

10 microliters of stock reagent were added to give final concentrations of 50 mM Tris pH 7.4, 10 
mM MgS04, 0.5 mM EDTA, 0.1% ascorbic acid in a final volume of 200 microliters. Tritium- 
labeled 8-hydroxy-DPAT (specific activity 135000 Ci/mole) was added to each tube to give a 
concentration of 1.5 nM. As a control, unlabeled metergoline (final concentration 100 
micromolar) was added to one of the tubes as a competitive ligand. After 1 hour on ice, the 

15 mixture was applied to 200 microliters of Ni-chelating resin to specifically bind receptor 
associated with 6Histidine-tagged MSP1 disks. The resin was washed three times with 0.5 ml 
of cold 50 mM Tris pH 7.4 to remove non-specifically bound ligand. Specifically bound 
radiolabeled 8-hydroxy-DPAT bound to receptor/disk complexes was eluted with 0.5 ml 0.5 
Molar imidazole in 1 0 mM Tris pH 7.4, 0.5 M NaCl. Scintillation cocktail was mixed with the 

20 eluate and specifically bound radioligand was determined by scintillation counting. Between five 
and fifteen percent of the receptor initially present in the sf9 membrane was found to be 
associated with MSP1 nanodisks. 

The particles into which the 5-HT GPCR had incorporated were dialyzed. Functionality 
(in terms of ligand binding) was tested using dialysis against buffer containing tritiated 8-OH- 
25 DP AT, an agonist of this receptor. The particles were then run over a Ni-NTA column to bind 
via the histidine tag on the MSP1 and to separate the particles from 8-OH-DPAT which had not 
bound to the particles, and the material bound to the column was then eluted. Association of the 
tritium labeled agonist was demonstrated, showing that the incorporated GPCR retained its 
ability to bind agonist. 
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Example 1 0. Analysis of MSP-supported Nanodisc Phospholipid Assemblies 

The particles resulting from self assembly of membrane scaffold proteins and 
phospholipids, either with or without an additional target protein, were analyzed as follows. 

Bacteriorhodopsin-containing particles were dialyzed, and the resulting mixture was 
5 injected onto a Superdex 200 HR10/30 gel filtration column (Pharmacia) and eluted with buffer 
at 0.5 ml/min at room temperature. Absorbance was monitored at 280 am for protein and 550 
nm for BR. 0.5 ml fractions were collected. The column was calibrated using a mixture of 
thyroglobulin (669 kDa, Stake's diameter 170 A), ferritin (440 kDa, Stake's diameter 122 A), 
catalase (232 kDa, Stake's diameter 92 A), lactate dehydrogenase (140 kDa, Stake's diameter 
10 82 A), bovine serum albumin (66 kDa, Stake's diameter 71 A), and horse heart cytochrome c 
(12.4 kDa, Stake's diameter 35.6 A). 

Atomic Force Microscopy (AFM) was performed with a Digital Instruments Nanoscope 
Ilia in contact mode with sharpened silicon nitride probes under buffer. MSP1 and MSP2 
dipalmitoyl phosphatidylcholine particles were treated with 1 :50 Factor Xa:MSP protein by mass 
15 in 10 mM Tris pH 8, 0.15 M NaCl, 2 mM CaCl 2 for 8 hours. 2-10 ml sample was placed on a 
freshly cleaved mica surface along with 20 ml imaging buffer (10 mM Tris pH 8, 0. 1 5 M NaCl, 
1 0 mM MgCy and incubated for 30 minutes or longer before mounting sample in the fluid cell. 
Several milliliters of buffer were flushed through the fluid cell to remove unadsorbed material. 

Phosphate analysis of the nanoscale particles was carried out as follows. Phosphate assay 
20 procedures were adapted from Chen et al. (1956) Anal. Chem. 28:1756-1758 and Fiske and 
Subbarow (1 925). Samples containing roughly 40 nmoles lipid phosphate were dried down in 
glass tubes. 75 ml 8.9 NH 2 S0 4 was added to each tube and heated to 210°C for 30 minutes. 1 
drop 30% H 2 0 2 was added to each tube and heated for 30 minutes. Tubes were cooled, 0.65 ml 
H 2 0 was added followed by 83.3 ml 2.5% w/v ammonium molybdate tetrahydrate followed by 
25 vortexing and the addition of 83.3 ml 10% w/v ascorbic acid. After mixing, the tubes were 
placed in a boiling water bath for 7 minutes. Absorbance was read at 820 nm. Absorbance was 
calibrated using potassium phosphate standards from 0 to 100 nmol phosphate. Buffer blanks 
from column chromatography were included for MSP proteins. 
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Example 12. MSP-supported Structures on Surfaces 

Nanodiscs comprising MSPs and a protein of interest can be assembled onto a gold 
surface. The utility of this relates to the resulting epitaxial presentation of a target incorporated 
into a nanodisc assembly to the solution. This offers an ideal system for quantitating binding of 
5 other macromolecules or small molecules tagged with dielectric contrast agents to the target 
protein. A common methods of accomplishing such measurements uses surface plasmon 
resonance (SPR) technology. SPR is a common technique used to monitor biomolecular 
interactions at surfaces. The ability of SPR to rapidly detect and quantitate unlabeled protein 
interactions on gold surfaces is useful for creating high through put chip assays for diverse 
10 membrane proteins (embedded and solubilized) on discs. 

Discs consisting of the phospholipid DPPC either with or without an additional thiolated 
lipid and MSP1 protein were prepared as follows. A 25 mM lipid mixture containing 
phosphatidylcholine was solubilized with 50 mMcholate in 10 mMTris CI, ISOmMNaClatpH 
8.0 were combined and incubated overnight at 37°C. For thiolated discs, 90% 

15 phosphatidylcholine and 10% thiolated lipid (ATA-TEG-DSPA, Northern Lipids) was 
solubilized in 3 .3 mM Tris CI, 66.7 mM borate, 1 50 mM NaCl at pH 9.0 in order to unmask the 
thiols in the thiolated lipids. MSP1 and lipid (1:100) were combined and incubated overnight 
at 37°C. The sample was then dialyzed at 37°C (10,000 MW cutoff membrane) against buffer 
containing 10 mM Tris CI, 150 mM NaCl at pH 8.0 without cholate for 2 hours. Dialysis was 

20 then continued at 4°C for an additional 6 hours with buffer changes every 2 hours. The 
approximately 1 ml sample was concentrated to <250 \xl using a YM-10 centrifuge concentrator 
and injected onto a Pharmacia 10/30 Superdex 200 HR gel filtration column. Samples were 
eluted from the column using the stated buffer without cholate at flow rates of 0.5 ml/min. 
Fractions from chromatography were analyzed by polyacrylamide gel electrophoresis using 8- 

25 25% gradient polyacrylamide gel to determine apparent size. 

The nanodisc samples (3-20 \xM) prepared as described were injected into an SPR 
instrument to determine if the discs would bind to the gold surface. Both the DPPC and 10% 
thiolated lipid discs adsorbed to a gold surface and a modified gold surface covered with a 
monolayer of methyl terminated thiol (nonanethiol) or carboxyl terminated thiol (11- 
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mercaptoundecanoic acid). Thiolated discs were injected using a buffer consisting of 3.3 mM 
Tris, 66.7 mM borate, 150 mM NaCl, pH 9.0. DPPC discs were injected using a buffer of 10 
mM Tris, 150 mM NaCl, pH 7.5 or pH 8.0. In all cases, the discs could not be removed even 
under harsh conditions (0.5 M HC1). Surface coverage was shown to increase with increasing 

5 concentration of discs injected (3 [iM vs. 19 jxM). Discs do not form perfectly packed 
monolayers; accordingly, surface coverage is limited by the jamming limit (theoretical maximum 
coverage based on random sequential absorption to the surface modeling discs as identical non- 
overlapping hard spheres) of 0.547. The coverage for a full monolayer of discs was calculated 
based on an assumption of disc height of 5.5 nm and a refractive index between 1.45 and 1.5. 

10 The full monolayer values were multiplied by the jamming limit to determine the maximum 
coverage that was then used to determine percent coverage based on experimental values. When 
the disc concentration was at least 10 ^M, the estimated coverages were between about 62 and 
about 103%. The resultant SPR trace demonstrating association of the nanodiscs to the gold 
surface is shown in Fig. 27. 
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WHAT IS CLAIMED IS: 

1 . A membrane scaffold protein that will, in an aqueous environment, self assemble in the 
absence of phospholipid or with a phospholipid or a mixture of phospholipids, into a 
nanoscale particle between about 5 nm and about 500 mn in diameter, wherein said 
membrane scaffold protein is amphipathic, and wherein said membrane scaffold protein 
forms at least one alpha helix. 

2. The membrane scaffold protein of claim 1, wherein said membrane scaffold protein 
assembles with a phospholipid or a mixture of phospholipids into a nanoscale particle of 
about 5 nm and about 500 nm in diameter, wherein a phospholipid bilayer is formed. 

3. The membrane scaffold protein of claim 2, wherein the phospholipid bilayer is discoidal. 

4. The membrane scaffold protein of claim 1 , wherein said membrane scaffold protein self 
assembles together with at least one hydrophobic or partially hydrophobic protein to form 
a nanoscale particle between about 5 nm and 500 nm in diameter, said nanoscale particles 
comprising the membrane scaffold protein and the at least one hydrophobic or partially 
hydrophobic protein. 

5. The membrane scaffold protein of any of claims 1 to 4, wherein said membrane scaffold 
protein self assembles in the absence of phospholipid to form a nanoscale particle 
between about 5 nm and about 500 nm in diameter. 

6. The membrane scaffold protein of claim 5, wherein said nanoscale particle is fiom about 
5 to about 100 nm in diameter. 

7. The membrane scaffold protein of claim 6, wherein said nanoscale particle is from about 
5 to about 50 nm in diameter. 
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9. 

10. 

11. 

12. 

13.. 

14. 

15. 
16. 



The membrane scaffold protein of any of claims 1 to 7, wherein said membrane scaffold 
protein comprises an amino acid sequence selected from the group consisting of SEQ ID 
NO:6, SEQ ED NO:9, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:23, SEQ ID NO:29, 
SEQ ID NO:43, SEQ ID NO:44 and SEQ ID NO:45. 

A nanoscale particle comprising the membrane scaffold protein of any of claims 1 to 8 
and at least one hydrophobic or partially hydrophobic protein, and optionally further 
comprising a phospholipid or a mixture of phospholipids, wherein said nanoscale particle 
has a diameter between about 5 nm and about 500 nm. 

The nanoscale particle of claim 9, wherein the hydrophobic or partially hydrophobic 
protein is a membrane protein. 

The nanoscale particle of claim 10, wherein said membrane protein is a tethered 
membrane protein. 

The nanoscale particle of claim 10, wherein the membrane protein is an embedded 
membrane protein. 

The nanoparticle assembly of claim 10, wherein the membrane protein is an integral 
membrane protein. 

The nanoscale particle of claim 13, wherein the membrane protein has seven 
transmembrane segments. 

The nanoscale particle of claim 1 0, wherein said membrane protein is a receptor protein. 

The nanoscale particle of claim 10, wherein said membrane protein is a G-protein 
coupled receptor. 
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17. 
18. 
19. 

20. 



21. 

22. 

23. 

24. 

25. 
26. 



The nanoscale particle of claim 16, wherein said G-protein coupled receptor is a 5- 
hydroxytryptamine receptor. 

The nanoscale particle of claim 8, wherein said membrane scaffold protein is fused 
genetically with the hydrophobic protein. 

The nanoscale particle of any of claims 9 to 1 8, wherein said membrane scaffold protein 
comprises an amino acid sequence selected from the group consisting of SEQ ID NO:6, 
SEQ ID NO:9, SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:23, SEQ ID NO:29, SEQ 
ID NO:43, SEQ ID NO:44 and SEQ ID NO:45. 

A method for incorporating at least one hydrophobic or partially hydrophobic protein into 
a nanoscale particle which is stable and soluble in aqueous solutions, said method 
comprising the step of allowing a membrane scaffold protein and at least one 
hydrophobic or partially hydrophobic protein to self assemble into nanoscale particles in 
an aqueous solution, optionally in the presence of at least one phospholipid, whereby 
nanoscale particles are formed. 

The method of claim 20, wherein said at least one hydrophobic or partially hydrophobic 
protein is a membrane protein. 

The method of claim 21 , wherein said membrane protein is a tethered membrane protein, 
an embedded membrane protein or an integral membrane protein. 

The method of claim 22, wherein said membrane protein is tissue factor. 

The method of claim 21, wherein said membrane protein is a receptor protein. 

The method of claim 24, wherein said receptor protein is a G-protein coupled receptor. 
The method of claim 25, wherein said G-protein coupled receptor is a 5- 
hydroxytryptamine receptor. 
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27. The method of any of claims 20 to 26, wherein said membrane scaffold protein comprises 
an amino acid sequence selected from the group consisting of SEQ ID NO:6, SEQ ID 
NO:9, SEQ ID NO: 17, SEQ ED NO: 1 9, SEQ ID NO:23, SEQ ID NO :29, SEQ ID NO:43, 
SEQ ID NO:44 and SEQ ED NO:45. 

28. The method of any of claims 20 to 27, wherein said at least one hydrophobic or partially 
hydrophobic protein is associated with membranes or membrane fragments. 

30. A method for identifying an competitor of binding of a ligand to a receptor protein 
wherein said receptor protein is incorporated within a nanoscale particle together with a 
membrane scaffold protein, said method comprising the steps of: 

(a) contacting a nanoscale particle comprising a membrane scaffold protein and a 
receptor protein with a detectable ligand to produce nanoscale particle-bound 
detectable ligand; 

(b) contacting the nanoscale particle-bound ligand with a test compound; 

(c) measuring detectable ligand released from the nanoscale particles; 

whereby a competitor of ligand binding is identified when contacting the nanoscale 
particle-bound ligand results in release of the detectable ligand. 

3 1 . The method of claim 3 0, wherein said receptor protein is a membrane protein. 

32. The method of claim 3 1 , wherein said receptor protein is a G-protein coupled receptor. 

33. The method of claim 32, wherein said G-protein coupled receptor is a 5- 
hydroxytryptamine receptor. 

34. The method of any of claims 30 to 33, wherein said membrane scaffold protein comprises 
an amino acid sequence selected from the group consisting of SEQ ID NO:6, SEQ ED 
NO:9, SEQ ID NO: 17, SEQ ID NO: 19, SEQIDNO^ SEQIDNO:29, SEQIDNO:43, 
SEQ ID NO:44 and SEQ ID NO:45. 
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35. A DNA molecule encoding a membrane scaffold protein, wherein said membrane 
scaffold protein comprises an amino acid sequence selected from the group consisting of 
SEQ ID NO:6, SEQ ID NO:9, SEQ ID NO: 1 7, SEQ ID NO: 19, SEQ ID NO:23, SEQ ID 
NO:29, SEQ ED NO:43, SEQ ID NO:44 and SEQ ID NO:45. 

36. A recombinant host cell comprising the DNA molecule of claim 35. 
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SEQUENCE LISTING 



<110> BOARD OP TRUSTEES OF THE UNIVERSITY OF ILLINOIS 



<120> Membrane Scaffold Proteins 

<130> 87-00 WO 

<140> Not assigned 
<141> 2001-11-20 

<150> US 60/252,233 
<151> 2000-11,-20 

<160> 46 

<170> Patentln Ver\ 2.0 

<210> 1 
<211> 762 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (1) . . (762)* 

<223> Restriction sites, Nco I and Hind III, are at 5 ! 
and 3 1 termini . 

<400> 1 

ccatggccca tttctggcag caagatgaac ccccccagag cccctgggat cgagtgaagg 60 
acctggccac tgtgtacgtg gatgtgctca aagacagcgg cagagactat gtgtcccagt 120 
ttgaaggctc cgccttggga aaacagctaa acctaaagct ccttgacaac tgggacagcg 180 
tgacctccac cttcagcaag ctgcgcgaac agctcggccc tgtgacccag gagttctggg 240 
ataacctgga aaaggagaca gagggcctga ggcaagagat gagcaaggat ctggaggagg 300 
tgaaggccaa ggtgcagccc tacctggacg acttccagaa gaagtggcag gaggagatgg 360 
agctctaccg ccagaaggtg gagccgctgc gcgcagagct ccaagagggc gcgcgccaga 420 
agctgcacga gctgcaagag aagctgagcc cactgggcga ggagatgcgc gaccgcgcgc 48 0 
gcgcccatgt ggacgcgctg cgcacgcatc tggcccccta cagcgacgag ctgcgccagc 540 
gcttggccgc gcgccttgag gctctcaagg agaacggcgg cgccagactg gccgagtacc 600 
acgccaaggc caccgagcat ctgagcacgc tcagcgagaa ggccaagccc gcgctcgagg 660 
acctccgcca aggcctgctg cccgtgctgg agagcttcaa ggtcagcttc ctgagcgctc 720 
tcgaggagta cactaagaag ctcaacaccc agtaataagc tt 762 

<210> 2 
<211> 250 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Ala His Phe Trp Gin Gin Asp Glu Pro Pro Gin Ser Pro Trp Asp 
1 5 10 " 15 
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Arg Val Lys Asp Leu Ala Thr Val Tyr Val Asp Val Leu Lys Asp Ser 
20 25 30 

Gly Arg Asp Tyr Val Ser Gin Phe Glu Gly Ser Ala Leu Gly Lys Gin 
35 40 45 

Leu Asn Leu Lys Leu Leu Asp Asn Trp Asp Ser Val Thr Ser Thr Phe 
50 55 60 

Ser Lys Leu Arg Glu Gin Leu Gly Pro Val Thr Gin Glu Phe Trp Asp 
65 70 75 80 

Asn Leu Glu Lys Glu Thr Glu Gly Leu Arg Gin Glu Met Ser Lys Asp 

85 90 ' 95 

Leu Glu Glu Val Lys Ala Lys Val Gin Pro Tyr Leu Asp Asp Phe Gin 
100 105 110 

Lys Lys Trp Gin Glu Glu Met Glu Leu Tyr Arg Gin Lys Val Glu Pro 
115 120 125 - 

Leu Arg Ala Glu Leu Gin Glu Gly Ala Arg Gin Lys Leu His Glu Leu 
130 135 140 

Gin Glu Lys Leu Ser Pro Leu Gly Glu Glu Met Arg Asp Arg Ala Arg 
145 150 155 160 

Ala His Val Asp Ala Leu Arg Thr His Leu Ala Pro Tyr Ser Asp Glu 

165 170 175 

Leu Arg Gin Arg Leu Ala Ala Arg Leu Glu Ala Leu Lys Glu Asn Gly 
180 185 190 

Gly Ala Arg Leu Ala Glu Tyr His Ala Lys Ala Thr Glu His Leu Ser 
195 200 205 

Thr Leu Ser Glu Lys Ala Lys Pro Ala Leu Glu Asp Leu Arg Gin Gly 
210 215 ' 220 

Leu Leu Pro Val Leu Glu Ser Phe Lys Val Ser Phe Leu Ser Ala Leu 
225 230 235 240 

Glu Glu Tyr Thr Lys Lys Leu Asn Thr Gin 

245 250 



<210> 3 
<211> 61 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 
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<400> 3 

tataccatgg gccatcatca tcatcatcat atagaaggaa gactaaagct ccttgacaac 60 
t 61 

<210> 4 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 



<400> 4 

gcaagcttat tactgggtgt tgagcttctt 

<210> 5 
<211> 654 



30 



<212> DNA - . _ 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Nucleotide 
sequence encoding Synthetic peptide. 

<400> 5 * 

tataccatgg gccatcatca tcatcatcat atagaaggaa gactaaagct ccttgacaac 60 
tgggacagcg tgacctccac cttcagcaag ctgcgcgaac agctcggccc tgtgacccag 120 
gagttctggg ataacctgga aaaggagaca gagggcctga ggcaggagat gagcaaggat 180 
ctggaggagg tgaaggccaa ggtgcagccc tacctggacg acttccagaa gaagtggcag 240 
gaggagatgg agctctaccg ccagaaggtg gagccgctgc gcgcagagct ccaagagggc 300 
gcgcgccaga agctgcacga gctgcaagag aagttgagcc cactgggcga ggagatgcgc 360 
gaccgcgcgc gcgcccatgt ggacgcgctg cgcacgcatc tggcccccta cagcgacgag 420 
ctgcgccagc gcttggccgc gcgccttgag gctctcaagg agaacggcgg cgccagactg 480 
gccgagtacc acgccaaggc caccgagcat ctgagcacgc tcagcgagaa ggccaaaccc 540 
gcgctcgagg acctccgcca aggcctgctg cccgtgctgg agagcttcaa ggtcagcttc 600 
ctgagcgctc tcgaggagta cactaagaag ctcaacaccc agtaataagc ttgc 654 

<210> 6 
<211> 212 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 6 

Met Gly His His His His His His lie Glu Gly Arg Leu Lys Leu Leu 
1 5 10 15 



Asp Asn Trp Asp Ser Val Thr Ser Thr Phe Ser Lys Leu Arg Glu Gin 
20 25 30 
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Leu Gly Pro Val Thr Gin Glu Phe Trp Asp Asn Leu Glu Lys Glu Thr 
35 40 45 

Glu Gly Leu Arg Gin Glu Met Ser Lys Asp Leu Glu Glu Val Lys Ala 
50 55 60 

Lys Val Gin Pro Tyr Leu Asp Asp Phe Gin Lys Lys Trp Gin Glu Glu 
65 70 75 80 

Met Glu Leu Tyr Arg Gin Lys Val Glu Pro Leu Arg Ala Glu Leu Gin 

85 90 95 

Glu Gly Ala Arg Gin Lys Leu His Glu Leu Gin Glu Lys Leu Ser Pro 
100 105 110 

Leu Gly Glu Glu Met Arg Asp Arg Ala Arg Ala His Val Asp Ala Leu 
115 120 125 

Arg Thr His Leu Ala Pro Tyr Ser Asp Glu Leu Arg Gin Arg Leu Ala 
130 135 140 

Ala Arg Leu Glu Ala Leu Lys Glu Asn Gly Gly Ala Arg Leu Ala Glu 
145 150 155 160 

Tyr His Ala Lys Ala Thr Glu His Leu Ser Thr Leu Ser Glu Lys Ala 

165 170 175 

Lys Pro Ala Leu Glu Asp Leu Arg Gin Gly Leu Leu Pro Val Leu Glu 
180 185 190 

Ser Phe Lys Val Ser Phe Leu Ser Ala Leu Glu Glu Tyr Thr Lys Lys 
195 200 205 



Leu Asn Thr Gin 
210 



<210> 7 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 7 

taccatggca aagctccttg acaactg 

<210> 8 
<211> 619 
<212> DHA 

<213> Artificial Sequence 
<220> 



4 
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<223> Description of Artificial Sequence; 
Oligonucleotide primer 



<400> 8 

taccatggca aagctccttg acaactggga 
cgaacagctc ggccctgtga cccaggagtt 
cctgaggcag gagatgagca aggatctgga 
ggacgacttc cagaagaagt ggcaggagga 
gctgcgcgca gagctccaag agggcgcgcg 
gagcccactg ggcgaggaga tgcgcgaccg 
gcatctggcc ccctacagcg acgagctgcg 
caaggagaac ggcggcgcca gactggccga 
cacgctcagc gagaaggcca aacccgcgct 
.gctggagagc ttcaaggtca gcttcctgag 
cacccagtaa taagcttgc 



cagcgtgacc tccaccttca gcaagctgcg 60 
ctgggataac ctggaaaagg agacagaggg 120 
ggaggtgaag gccaaggtgc agccctacct 180 
gatggagctc taccgccaga aggtggagcc 240 
ccagaagctg cacgagctgc aagagaagtt 3 00 
cgcgcgcgcc catgtggacg cgctgcgcac 360 
ccagcgcttg gccgcgcgcc ttgaggctct 420 
gtaccacgcc aaggccaccg agcatctgag 480 
cgaggacctc cgccaaggcc tgctgcccgt 540 
cgctctcgag gagtacacta agaagctcaa 600 

619 



<210> 9 

<211> 201 

<212> PRT r 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 9 

Met Ala Lys. Leu Leu Asp Asn Trp Asp Ser Val Thr Ser Thr Phe Ser 
15 10 15 

Lys Leu Arg Glu Gin Leu Gly Pro Val Thr Gin Glu Phe Trp Asp Asn 
20 25 30 

Leu Glu Lys Glu Thr Glu Gly Leu Arg Gin Glu Met Ser Lys Asp Leu 
35 40 45 

Glu Glu Val Lys Ala Lys Val Gin Pro Tyr Leu Asp Asp Phe Gin Lys 
50 55 60 

Lys Trp Gin Glu Glu Met Glu Leu Tyr Arg Gin Lys Val Glu Pro Leu 
65 70 75 80 

Arg Ala Glu Leu Gin Glu Gly Ala Arg Gin Lys Leu His Glu Leu Gin 

85 90 95 

Glu Lys Leu Ser Pro Leu Gly Glu Glu Met Arg Asp Arg Ala Arg Ala 
• 100 105 HO 

His Val Asp Ala Leu Arg Thr His Leu Ala Pro Tyr Ser Asp Glu Leu 
115 120 125 

Arg Gin Arg Leu Ala Ala Arg Leu Glu Ala Leu Lys Glu Asn Gly Gly 
130 135 140 

Ala Arg Leu Ala Glu Tyr His Ala Lys Ala Thr Glu His Leu Ser Thr 
145 150 155 ISO 
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Leu Ser Glu Lys Ala Lys Pro Ala Leu Glu Asp Leu Arg Gin Gly Leu 

165 170 175 

Leu Pro Val Leu Glu Ser Phe Lys Val Ser Phe Leu Ser Ala Leu Glu 
180 185 190 

Glu Tyr Thr Lys Lys Leu Asn Thr Gin 
195 200 



<210> 10 

<211> 27 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 10 

taccatggca aagctccttg acaactg 27 

<210> 11 
<211> 61 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 11 

tataccatgg gccatcatca tcatcatcat atagaaggaa gactaaagct ccttgacaac 60 
t 51 

<210> 12 
<211> 52 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 12 

taagaagctc aacacccagg gtaccggtgg aggtagtgga ggtggtaccc ta 

<210> 13 
<2ll> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 
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<400> 13 

cagggtaccg gtggaggtag tggaggtggt accctaaagc tccttgacaa 



50 



<210> 14 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 14 

gcaagcttat tactgggtgt tgagcttctt 30 

<210> 15 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 15 

Gly Thx Gly Gly Gly Ser Gly Gly Gly Thr 
15 10 



<210> 16 
<211> 1260 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nucleotide 
sequence encoding a synthetic peptide. 

<400> 16 

tataccatgg gccatcatca tcatcatcat atagaaggaa gactaaagct ccttgacaac 60 
tgggacagcg tgacctccac cttcagcaag ctgcgcgaac agctcggccc tgtgacccag 120 
gagttctggg ataacctgga aaaggagaca gagggcctga ggcaggagat gagcaaggat 180 
ctggaggagg tgaaggccaa ggtgcagccc tacctggacg acttccagaa gaagtggcag 240 
gaggagatgg agctctaccg ccagaaggtg gagccgctgc gcgcagagct ccaagagggc 300 
gcgcgccaga agctgcacga gctgcaagag aagctgagcc cactgggcga ggagatgcgc 360 
gaccgcgcgc gcgcccatgt ggacgcgctg cgcacgcatc tggcccccta cagcgacgag 420 
ctgcgccagc gct€ggccgc gcgccttgag gctctcaagg agaacggcgg cgccagactg 480 
gccgagtacc acgccaaggc caccgagcat ctgagcacgc tcagcgagaa ggccaagccc 540 
gcgctcgagg acctccgcca aggcctgctg cccgtgctgg agagcttcaa ggtcagcttc 600 
ctgagcgctc tcgaggagta cactaagaag ctcaacaccc agggtaccct aaagctcctt 660 
gacaactggg acagcgtgac ctccaccttc agcaagctgc gcgaacagct cggccctgtg 720 
acccaggagt tctgggataa cctggaaaag gagacagagg gcctgaggca ggagatgagc 780 
aaggatctgg aggaggtgaa ggccaaggtg cagccctacc tggacgactt ccagaagaag 840 
tggcaggagg agatggagct ctaccgccag aaggtggagc cgctgcgcgc agagctccaa 900 
gagggcgcgc gccagaagct gcacgagctg caagagaagc tgagcccact gggcgaggag 960 
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atgcgcgacc gcgcgcgcgc ccatgtggac gcgctgcgca cgcatctggc cccctacagc 102 0 
gacgagctgc gccagcgctt ggccgcgcgc cttgaggctc tcaaggagaa cggcggcgcc 1080 
agactggccg agtaccacgc caaggccacc gagcatctga gcacgctcag cgagaaggcc 1140 
aagcccgcgc tcgaggacct ccgccaaggc ctgctgcccg tgctggagag cttcaaggtc 1200 
agcttcctga gcgctctcga ggagtacact aagaagctca acacccagta ataagcttgc 1260 

<210> 17 
<211> 414 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 17 

Met Gly His His His His His His He Glu Gly Arg Leu Lys Leu Leu 
1 5- 10 15 

Asp Asn Trp Asp Ser Val Thr Ser Thr Phe Ser Lys Leu Arg Glu Gin 
20 25 30 

Leu Gly Pro Val Thr Gin Glu Phe Trp Asp Asn Leu Glu Lys Glu Thr 
35 40 45 

Glu Gly Leu Arg Gin Glu Met Ser Lys Asp Leu Glu Glu Val Lys Ala 
'50 55 60 

Lys Val Gin Pro Tyr Leu Asp Asp Phe Gin Lys Lys Trp Gin Glu Glu 
65 70 75 80 

Met Glu Leu Tyr Arg Gin Lys Val Glu Pro Leu Arg Ala Glu Leu Gin 

85 90 95 

Glu Gly Ala Arg Gin Lys Leu His Glu Leu Gin Glu Lys Leu Ser Pro 
100 105 110 

Leu Gly Glu Glu Met Arg Asp Arg Ala Arg Ala His Val Asp Ala Leu 
115 120 125 

Arg Thr His Leu Ala Pro Tyr Ser Asp Glu Leu Arg Gin Arg Leu Ala 
130 135 140 

Ala Arg Leu Glu Ala Leu Lys Glu Asn Gly Gly Ala Arg Leu Ala Glu 
145 150 155 160 

Tyr His Ala Lys Ala Thr Glu His Leu Ser Thr Leu Ser Glu Lys Ala 

165 170 175 

Lys Pro Ala Leu Glu Asp Leu Arg Gin Gly Leu Leu Pro Val Leu Glu 
180 185 190 

Ser Phe Lys Val Ser Phe Leu Ser Ala Leu Glu Glu Tyr Thr Lys Lys 
195 200 205 
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Leu Asn Thr Gin Gly Thr Leu Lys Leu Leu Asp Asn Trp Asp Ser Val 
210 215 220' 

Thr Ser Thr Phe Ser Lys Leu Arg Glu Gin Leu Gly Pro Val Thr Gin 
225 230 235 240 

Glu Phe Trp Asp Asn Leu Glu Lys Glu Thr Glu Gly Leu Arg Gin Glu 

245 250 255 

Met Ser Lys Asp Leu Glu Glu Val Lys Ala Lys Val Gin Pro Tyr Leu 
260 265 270 

Asp Asp Phe Gin Lys Lys Trp Gin Glu Glu Met Glu Leu Tyr Arg Gin 
275 280 285 

Lys Val Glu Pro Leu Arg Ala Glu Leu Gin Glu Gly Ala Arg Gin Lys 
290 295 300 

Leu His Glu Leu Gin Glu Lys Leu Ser Pro Leu Gly Glu Glu Met Arg 
305 310 315 320 

Asp Arg Ala Arg Ala His Val Asp Ala Leu Arg Thr His Leu Ala Pro 

325 330 335 

Tyr Ser Asp Glu Leu Arg Gin Arg Leu Ala Ala Arg Leu Glu Ala Leu 
340 345 350 

Lys Glu Asn Gly Gly Ala Arg Leu Ala Glu Tyr His Ala Lys Ala Thr 
355 360 365 

Glu His Leu Ser Thr Leu Ser Glu Lys Ala Lys Pro Ala Leu Glu Asp 
370 375 380 

Leu Arg Gin Gly Leu Leu Pro Val Leu Glu Ser Phe Lys Val Ser Phe 
385 390 395 400 

Leu Ser Ala Leu Glu Glu Tyr Thr Lys Lys Leu Asn Thr ( Gin 

405 410 



<210> 18 
<211> 1282 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide sequence encoding an artificial 
protein. 

<400> 18 

taccatgggc catcatcatc atcatcatat agaaggaaga ctaaagctcc ttgacaactg 60 
ggacagcgtg acctccacct tcagcaagct gcgcgaacag ctcggccctg tgacccagga 120 
gttctgggat aacctggaaa aggagacaga gggcctgagg caggagatga gcaaggatct 180 
Sfgaggaggtg aaggccaagg tgcagcccta cctggacgac ttccagaaga agtggcagga 240 
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ggagatggag ctctaccgcc agaaggtgga gccgctgcgc gcagagctcc aagagggcgc 300 
gcgccagaag ctgcacgagc tgcaagagaa gctgagccca ctgggcgagg agatgcgcga 360 
ccgcgcgcgc gcccatgtgg acgcgctgcg cacgcatctg gccccctaca gcgacgagct 420 
gcgccagcgc ttggccgcgc gccttgaggc tctcaaggag aacggcggcg ccagactggc 480 
cgagtaccac gccaaggcca ccgagcatct gagcacgctc agcgagaagg ccaagcccgc 540 
gctcgaggac ctccgccaag gcctgctgcc cgtgctggag agcttcaagg tcagcttcct 6.00 
gagcgctctc gaggagtaca ctaagaagct caacacccag ggtaccggtg gaggtagtgg 660 
aggtggtacc ctaaagctcc ttgacaactg ggacagcgtg acctccacct tcagcaagct 720 
gcgcgaacag ctcggccctg tgacccagga gttctgggat aacctggaaa aggagacaga 780 
gggcctgagg caggagatga gcaaggatct ggaggaggtg aaggccaagg tgcagcccta 840 
cctggacgac ttccagaaga agtggcagga ggagatggag ctctaccgcc agaaggtgga 900 
gccgctgcgc gcagagctcc aagagggcgc gcgccagaag ctgcacgagc tgcaagagaa 960 
gctgagccca ctgggcgagg agatgcgcga ccgcgcgcgc gcccatgtgg acgcgctgcg 1020 
cacgcatctg gccccctaca gcgacgagct gcgccagcgc ttggccgcgc gccttgaggc 1080 
tctcaaggag aacggcggcg ccagactggc cgagtaccac gccaaggcca ccgagcatct 1140 
gagcacgctc agcgagaagg ccaagcccgc gctcgaggac ctccgccaag gcctgctgcc 1200 
cgtgctggag agcttcaagg tcagcttcct gagcgctctc gaggagtaca ctaagaagct 1260 
caacacccag taataagctt gc 1282 

<210> 19 - 
<211> 422 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 19 

Met Gly His His His His His His He Glu Gly Arg Leu Lys Leu Leu 
15 10 15 

Asp Asn Trp Asp Ser Val Thr Ser Thr Phe Ser Lys Leu Arg Glu Gin 
20 25 30 

Leu Gly Pro Val Thr Gin Glu Phe Trp Asp Asn Leu Glu Lys Glu Thr 
35 40 45 

Glu Gly Leu Arg Gin Glu Met Ser Lys Asp Leu Glu Glu Val Lys Ala 
50 55 60 

Lys Val Gin Pro Tyr Leu Asp Asp Phe Gin Lys Lys Trp Gin Glu Glu 
65 70 75 80 

Met Glu Leu Tyr Arg Gin Lys Val Glu Pro Leu Arg Ala Glu Leu Gin 

85 90 95 

Glu Gly Ala Arg Gin Lys Leu His Glu Leu Gin Glu Lys Leu Ser Pro 
100 105 110 

Leu Gly Glu Glu Met Arg Asp Arg Ala Arg Ala His Val Asp Ala Leu 
115 120 125 

Arg Thr His Leu Ala Pro Tyr Ser Asp Glu Leu Arg Gin Arg Leu Ala 
130 135 140 
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Ala Arg Leu Glu Ala Leu Lys Glu Asn Gly Gly Ala Arg Leu Ala Glu 
145 150 155 160 

Tyr His Ala Lys Ala Thr Glu His Leu Ser Thr Leu Ser Glu Lys Ala 

165 170 175 

Lys Pro Ala Leu Glu Asp Leu Arg Gin Gly Leu Leu Pro Val Leu Glu 
180 185 190 

Ser Phe Lys Val Ser Phe Leu Ser Ala Leu Glu Glu Tyr Thr Lys Lys 
195 200 205 

Leu Asn Thr Gin Gly Thr Gly Gly Gly Ser Gly Gly Gly Thr Leu Lys 
210 215 220 



Leu Leu Asp Asn Trp Asp Ser Val Thr Ser Thr Phe Ser Lys Leu Arg 
225 230 235 240 

Glu Gin Leu Gly Pro Val Thr Gin Glu Phe Trp Asp Asn Leu Glu Lys 

245 " 250 255 - 

Glu Thr Glu Gly Leu Arg Gin Glu Met Ser Lys Asp Leu Glu Glu Val 
260 265 270 

Lys Ala Lys Val Gin Pro Tyr Leu Asp Asp Phe Gin Lys Lys Trp Gin 
275 280 285 

Glu Glu Met Glu Leu Tyr Arg Gin Lys Val Glu Pro Leu Arg Ala Glu 
290 295 300 

Leu Gin Glu Gly Ala Arg Gin Lys Leu His Glu Leu Gin Glu Lys Leu 
305 310 315 320 

Ser Pro Leu Gly Glu Glu Met Arg Asp Arg Ala Arg Ala His Val Asp 

325 330 335 

Ala Leu Arg Thr His Leu Ala Pro Tyr Ser Asp Glu Leu Arg Gin Arg * 
340 345 350 

Leu Ala Ala Arg Leu Glu Ala Leu Lys Glu Asn Gly Gly Ala Arg Leu 
355 360 365 

Ala Glu Tyr His Ala Lys Ala Thr Glu His Leu Ser Thr Leu Ser Glu 
370 375 380 

Lys Ala Lys Pro Ala Leu Glu Asp Leu Arg Gin Gly Leu Leu Pro Val 
385 390 395 400 

Leu Glu Ser Phe Lys Val Ser Phe Leu Ser Ala Leu Glu Glu Tyr Thr 

405 410 415 

Lys Lys Leu Asn Thr Gin 
420 
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<210> 20 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 20 

tggagctcta ccgccagaag gtggagccct acagcgacga get 43 

<210> 21 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description -of Artificial Sequence: 

Oligonucleotide primer — 



<400> 21 

gcaagcttat tactgggtgt tgagcttctt 

<210> 22 
<211>- 522 
<212> DNA 

<213> Artificial Sequence 



30 



<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide sequence encoding an artificial 
peptide. 

<400> 22 

tataccatgg gccatcatca tcatcatcat atagaaggaa gactaaagct ccttgacaac 60 
tgggacagcg tgacctccac cttcagcaag ctgcgcgaac agctcggccc tgtgacccag 120 
gagttctggg ataacctgga aaaggagaca gagggectga ggcaggagat gagcaaggat 180 
ctggaggagg tgaaggccaa ggtgcagccc tacctggacg acttccagaa gaagtggcag 240 
gaggagatgg agctctaccg ccagaaggtg gagccctaca gegacgaget gcgccagcgc 300 
ttggccgcgc gecttgagge tctcaaggag aacggcggcg ccagactggc cgagtaccac 360 
gccaaggcca ccgagcatct gagcacgctc agegagaagg ccaaacccgc gctcgaggac 420 
ctccgccaag gcctgctgcc cgtgctggag agcttcaagg tcagcttcct gagcgctctc 480 
gaggagtaca ctaagaagct caacacccag taataagctt gc 522 

<210> 23 
<211> 168 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 23 
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Met Gly His His His His His His lie Glu Gly Arg Leu Lys Leu Leu 
1 .5 10 15 

Asp Asn Trp Asp Ser Val Thr Ser Thr Phe Ser Lys Leu Arg Glu Gin 
20 25 30 

Leu Gly Pro Val Thr Gin Glu Phe Trp Asp Asn Leu Glu Lys Glu Thr 
35 40 45 

Glu Gly Leu Arg Gin Glu Met Ser Lys Asp Leu Glu Glu Val Lys Ala 
50 55 60 

Lys Val Gin Pro Tyr Leu Asp Asp Phe Gin Lys Lys Trp Gin Glu Glu 
65 70 75 80 

Met Glu Leu Tyr Arg Gin Lys Val Glu Pro Tyr Ser Asp Glu Leu Arg 

85 90 95 

Gin Arg Leu Ala Ala Arg Leu Glu Ala Leu Lys Glu Asn Gly Gly Ala 
100 " 105 110 

Arg Leu Ala Glu Tyr His Ala Lys Ala Thr Glu His Leu 'Ser Thr Leu 
115 120 125 

Ser Glu Lys Ala Lys Pro Ala Leu Glu Asp Leu Arg Gin Gly Leu Leu 
130 135 140 

Pro Val Leu Glu Ser Phe Lys Val Ser Phe Leu Ser Ala Leu Glu Glu 
145 ' 150 155 160 



Tyr Thr Lys Lys Leu Asn Thr Gin ■ 

165 



<210> 24 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 24 

cagaattcgc tagccgagta ccacgccaa 

<210> 25 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer . 



13 



WO 02/4050 1 



PCT/USO 1/43451 



<400> 25 

gcaagcttat tactgggtgt tgagcttctt 



30 



<210> 26 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 26 

ataccatggg ccatcatcat catcatcata 30 

<210> 27 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 



<210> 28 
<211> 522 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide sequence encoding an artificial 
peptide . 

<400> 28 

tataccatgg gccatcatca tcatcatcat atagaaggaa gactaaagct ccttgacaac 60 
tgggacagcg tgacctccac cttcagcaag ctgcgcgaac agctcggccc tgtgacccag 12 0 
gagttctggg ataacctgga aaaggagaca gagggcctga ggcaggagat gagcaaggat 180 
ctggaggagg tgaaggccaa ggtgcagccc tacctggacg acttccagaa gaagtggcag 240 
gaggagatgg agctctaccg ccagaaggtg gagccgctgc gcgcagagct ccaagagggc 300 
gcgcgccaga agctgcacga gctgcaagag aagttgagcg ccaggctagc cgagtaccac 360 
gccaaggcca ccgagcatct gagcacgctc agcgagaagg ccaaacccgc gctcgaggac 420 
ctccgccaag gcctgctgcc cgtgctggag agcttcaagg tcagcttcct gagcgctctc 480 
gaggagtaca ctaagaagct caacacccag taataagctt gc 522 

<210> 29 
<211> 168 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 27 

cagaattcgc tagcctggcg ctcaacttct ctt 



33 
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peptide 
<400> 29 

Met Gly His His His His His His He Glu Gly Arg Leu Lys Leu Leu 
15 10 15 

Asp Asn Trp Asp Ser Val Thr Ser Thr Phe Ser Lys Leu Arg Glu Gin 
20 25 30 

Leu Gly Pro Val Thr Gin Glu Phe Trp Asp Asn Leu Glu Lys Glu Thr 
35 40 > t 45 

Glu Gly Leu Arg Gin Glu Met Ser Lys Asp Leu Glu Glu Val Lys Ala 
50 55 * 60 

Lys Val Gin Pro Tyr Leu Asp Asp Phe Gin Lys Lys Trp Gin Glu Glu 
65 70 75 80 

Met Glu Leu Tyr Arg Gin Lys Val Glu Pro Leu Arg Ala Glu Leu Gin 

85 '90 95 _ 

Glu Gly Ala Arg Gin Lys Leu His Glu Leu Gin Glu Lys Leu Ser Ala 
100 105 110 

Arg Leu Ala Glu Tyr His Ala Lys Ala Thr Glu His Leu Ser Thr Leu 
X15 120 125 

Ser Glu Lys Ala Lys Pro Ala Leu Glu Asp Leu Arg Gin Gly Leu Leu 
130 135 140 

Pro Val Leu Glu Ser Phe Lys Val Ser Phe Leu Ser Ala Leu Glu Glu 
145 150 155 - 160 

Tyr Thr Lys Lys Leu Asn Thr Gin 

165 



<210> 30 
<211> 77 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 

<400> 30 

taccatgggt catcatcatc atcatcacat tgagggacgt ctgaagctgt tggacaattg 60 
ggactctgtt acgtcta 77 

<210> 31 
<211> 62 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 

<400> 31 

aggaattctg ggacaacctg gaaaaagaaa ccgagggact gcgtcaggaa atgtccaaag 60 
at 62 

<210> 32 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 

<400> 32 

tatctagatg actttcagaa aaaatggcag gaagagatgg aattatatcg tcaa 54 

<210> 33 
<211> 73 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 

<400> 33 

atgagctcca agagaagctc agcccattag gcgaagaaat gcgcgatcgc gcccgtgcac 60 
atgttgatgc act 73 

<210> 34 
<211> 65 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 

<400> 34 

gtctcgaggc gctgaaagaa aacgggggtg cccgcttggc tgagtaccac gcgaaagcga 60 
cagaa 65 

<210> 35 
<211> 56 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 
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<400> 35 

gaagatctac gccagggctt attgcctgtt cttgagagct ttaaagtcag ttttct 56 

<210> 36 
<211> 61 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 

<400> 36 

cagaattcct gcgtcacggg gcccagttgt tcgcgaagtt tactgaaggt agacgtaaca 60 
g 61 

<210> 37 
<211> 55 
<212> DNA 

<213> Artificial Sequence - 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 



<400> 37 

tcatctagat atggctgaac cttggccttc acctcttcta aatctttgga cattt 55 

<210> 38 
<211> 80 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 

<400> 38 

tggagctcat ggagtttttg gcgtgccccc tcttgcagtt ccgcacgcag cggttccacc 60 • 
ttttgacgat ataattccat 80 

<210> 39 
<211> 76 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 



<400> 39 

gcctcgagac gtgcggccaa acgctggcga agttcatccg aatacggcgc caaatgagtc 60 
cggagtgcat caacat 76 

<210> 40 
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<211> 61 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 

<400> 40 

gtagatcttc cagcgccggt ttcgcttttt cgctcaaggt gctcaggtgt tctgtcgctt 60 
t 61 

<210> 41 
<211> 66 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description _pf Artificial Sequence; Synthetic 

nucleotide " — 



<400> 41 

ccaagcttat tactgggtat tcagcttttt agtatattct tccagagctg acagaaaact 60 
gacttt ' 66 

<210> 42 
<211> 651 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 

<400> 42 

accatgggtc atcatcatca tcatcacatt gagggacgtc tgaagctgtt ggacaattgg 60 
gactctgtta cgtctacctt cagtaaactt cgcgaacaac tgggccccgt gacgcaggaa 120 
ttctgggaca acctggaaaa agaaaccgag ggactgcgtc aggaaatgtc caaagattta 180 
gaagaggtga aggccaaggt tcagccatat ctagatgact ttcagaaaaa atggcaggaa 24 0 
gagatggaat tatatcgtca aaaggtggaa ccgctgcgtg cggaactgca agagggggca 300 
cgccaaaaac tccatgagct ccaagagaag ctcagcccat taggcgaaga aatgcgcgat 360 
cgcgcccgtg cacatgttga tgcactccgg actcatttgg cgccgtattc ggatgaactt 420 
cgccagcgtt tggccgcacg tctcgaggcg ctgaaagaaa acgggggtgc ccgcttggct 48 0 
gagtaccacg cgaaagcgac agaacacctg agcaccttga gcgaaaaagc gaaaccggcg 54 0 
ctggaagatc tacgccaggg cttattgcct gttcttgaga gctttaaagt cagttttctg 600 
tcagctctgg aagaatatac taaaaagctg aatacccagt aataagcttg g 651 

<210> 43 
<211> 201 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetxc 
peptide 
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<400> 43 

Met Gly His His His His His His lie Glu Gly Arg Leu Lys Leu Leu 
15 20 15 



Asp Asn Trp Asp Ser Val Thr Ser Thr Phe Ser Lys Leu Arg Glu Gin 
20 25 30 

Leu Gly Pro Val Thr Gin Glu Phe Trp Asp Asn Leu Glu Lys Glu Thr 
35 40 45 

Glu Gly Leu Arg Gin Glu Met Ser Pro Tyr Leu Asp Asp Phe Gin Lys 
50 55 60 



Lys Trp Gin Glu Glu Met Glu Leu Tyr Arg Gin Lys Val Glu Pro Leu 
65 70 75 80 

Arg Ala Glu Leu Gin Glu Gly Ala Axg Gin Lys Leu His Glu Leu Gin 

85 _ 90 95 

Glu Lys Leu Ser Pro Leu Gly Glu Glu Met Arg Asp Arg Ala Arg Ala 
100 105 110 



His Val Asp Ala Leu Arg Thr His Leu Ala Pro Tyr Ser Asp Glu Leu 
115 120 125 

Arg Gin Arg Leu Ala Ala Arg Leu Glu Ala Leu Lys Glu Asn Gly Gly 
130 135 140 

Ala Arg Leu Ala Glu Tyr His Ala Lys Ala Thr Glu His Leu Ser Thr 
145 ISO 155 160 

Leu Ser Glu Lys Ala Lys Pro Ala Leu Glu Asp Leu Arg Gin Gly Leu 

165 170 175 



Leu Pro Val Leu Glu Ser Phe Lys Val Ser Phe Leu Ser Ala Leu Glu 
180 185 190 

Glu Tyr Thr Lys Lys Leu Asn Thr Gin 
195 200 

<210> 44 
<211> 201 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Synthetic 
peptide 

<400> 44 

Met Gly His His His His His His lie Glu Gly Arg Leu Lys Leu Leu 
1 5 10 15 

Asp Asn Trp Asp Ser Val Thr Ser Thr Phe Ser Lys Leu Arg Glu Gin 
20 25 30 
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Leu Gly Pro Val Thr Gin Glu Phe Trp Asp Asn Leu Glu Lys Glu Thr 
35 40 45 

Glu Gly Leu Arg Gin Glu Met Ser Lys Asp Leu Glu Glu Val Lys Ala 
50 55 60 

Lys Val Gin Pro Tyr Leu Asp Asp Phe Gin Lys Lys Trp Gin Glu Glu 
65 70 75 80 

Met Glu Leu Tyr Arg Gin Lys Val Glu Pro Leu Arg Ala Glu Leu Gin 

85 90 95 

Glu Gly Ala Arg Gin Lys Leu His Glu Leu Gin Glu Lys Leu Ser Pro 
100 105 110 

Leu Gly Glu Glu Met Arg Asp Arg Ala Arg Ala His Val Asp Ala Leu 
115 120 125 

Arg Thr His Leu Ala Pro Tyr Ser Asp Glu Leu Arg Gin Arg Leu Ala 
130 * 135 140 

Ala Arg Leu Glu Ala Leu Lys Glu Asn Gly Gly Ala Arg Leu Ala Glu 
145 150 - 155 160 

Tyr His Ala Lys Ala Thr Glu His Leu Ser Thr Leu Ser Glu Lys Ala 

165 170 175 

Lys Pro Val Leu Glu Ser Phe Lys Val Ser Phe Leu Ser Ala Leu Glu 
180 185 190 

Glu Tyr Thr Lys Lys Leu Asn Thr Gin 
195 200 



<210> 45 
<211> 392 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 45 

Met Gly His His His His His His lie Glu Gly Arg Leu Lys Leu Leu 
15 10 15 

Asp Asn Trp Asp Ser Val Thr Ser Thr Phe Ser Lys Leu Arg Glu Gin 
20 25 30 

Leu Gly Pro Val Thr Gin Glu Phe Trp Asp Asn Leu Glu Lys Glu Thr 
35 40 45 

Glu Gly Leu Arg Gin Glu Met Ser Pro Tyr Leu Asp Asp Phe Gin Lys 
50 55 60 
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Lys Trp Gin Glu- Glu Met Glu Leu Tyr Arg Gin Lys Val Glu Pro Leu 
65 70 75 80 

Arg Ala Glu Leu Gin Glu Gly Ala Arg Gin Lys Leu His Glu Leu Gin 

85 90 95 

Glu Lys Leu Ser Pro Leu Gly Glu Glu Met Arg Asp Arg Ala Arg Ala 
100 105 110 

His Val Asp Ala Leu Arg Thr His Leu Ala Pro Tyr Ser Asp Glu Leu 
115 120 125 

Arg Gin Arg Leu Ala Ala Arg Leu Glu Ala Leu Lys Glu Asn Gly Gly 
130 135 140 

Ala Arg Leu Ala Glu Tyr His Ala Lys Ala Thr Glu His Leu Ser Thr 
145 150 155 160 

Leu Ser Glu Lys Ala- Lys Pro Ala Leu Glu Asp Leu Arg Gin Gly Leu 

1^5 170 175 



Leu Pro Val Leu Glu Ser Phe Lys 
180 

Glu Tyr Thr Lys Lys Leu Asn Thr 
195 200 



Val Ser Phe Leu Ser Ala Leu Glu 
185 190 

Gin Gly Thr Leu Lys Leu Leu Asp 

205 



Asn Trp Asp Ser Val Thr Ser Thr 
210 215 

Gly Pro Val Thr Gin Glu Phe Trp 
225 230 

Gly Leu Arg Gin Glu Met Ser Pro 

245 

Trp Gin Glu Glu Met Glu Leu Tyr 
260 

Ala Glu Leu Gin Glu Gly Ala Arg 
275 280 

Lys Leu Ser Pro Leu Gly Glu Glu 
290 295 



Phe Ser Lys Leu Arg Glu Gin Leu 
220 

Asp Asn Leu Glu Lys Glu Thr Glu 
235 240/ 

Tyr Leu Asp Asp Phe Gin Lys Lys 
250 255 

Arg Gin Lys Val Glu Pro Leu Arg 
265 270 

Gin Lys Leu His Glu Leu Gin Glu 

285 

Met Arg Asp Arg Ala Arg Ala His 
300 



Val Asp Ala Leu Arg Thr His Leu Ala Pro Tyr Ser Asp Glu Leu Arg 
305 310 315 320 

Gin Arg Leu Ala Ala Arg Leu Glu Ala Leu Lys Glu Asn Gly Gly Ala 

325 330 335 

Arg Leu Ala Glu Tyr His Ala Lys Ala Thr Glu His Leu Ser Thr Leu 
340 345 350 



21 



WO 02/40501 PCT/US01/43451 



Ser Glu Lys Ala Lys Pro Ala Leu Glu Asp Leu Arg Gin Gly Leu Leu 
355 360 365 

Pro Val Leu Glu Ser Phe Lys Val Ser Phe Leu Ser Ala Leu Glu Glu 
370 375 380 

Tyr Thr Lys Lys Leu Asn Thr Gin 
385 390 



<210> 46 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<220> 

<221> VARIANT 
<222> (4) 

<223> X at position 4 is serine or threonine. 

<400> 46 
Gly Gly Gly Xaa 
1 
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Applicants have elected a membrane scaffold protein with amino acid sequence set forth in SEQ ID NO: 17, which is encoded by SEQ 
ID NO: 16 to be searched by this International Searching Authority. 
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